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Patent- og Varemaerkestyrelsen 



TITLE: Mutated prokaryotic cells with high secretion-levels -....) 

FIELD OF INVENTION 

The present invention relates to mutated prokaryotic cells, which secrete higher 
amounts of at least one heterologous polypeptide of interest and which have a reduced 
expression-level of YusZ or YusX, or homologues thereof, when compared with otherwise 
isogenic but non-mutated cells, and methods for constructing and using such cells in the 
production of polypeptides. 

BACKGROUND 

The yusZ and yusX DNA sequences were, first reported in 1993, but merely as 
putative open reading frames (Chen et al, 1993, Metalloregulation in Bacillus subtilis: isolation 
and characterization of two genes differentially repressed by metal ions, J Bact 175(17): 5428- 
5437). 

In a later publication it was speculated that yusX, and an open reading frame located 
immediately upstream of yusX, denoted yusY t could have arisen from a frameshift mutation in 
a single yusXY gene. However, no further investigation was carried out and the authors of the 
publication concluded that the function of the gene(s) in the cell remained unknown (Kanamaru 
et al, 2002, Overexpression of the PepF Oligopeptidase Inhibits Sporulation Initiation in 
Bacillus subtilis, J Bact 184(1): 43-50). 

DEFINITIONS 

In accordance with the present invention there may be employed conventional 
molecular biology, microbiology, and recombinant DNA techniques within the skill of the art. 
Such techniques are explained fully in the literature. See, e.g., Sambrook, Fritsch & Maniatis, 
Molecular Cloning: A Laboratory Manual, Second Edition (1989) Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (herein "Sambrook et al., 1989") DNA 
Cloning: A Practical Approach, Volumes I and II /D.N. Glover ed. 1985); Oligonucleotide 
Synthesis (M.J. Gait ed. 1984); Nucleic Acid Hybridization (B.D. Hames & S.J. Higgins eds 
(1985)); Transcription And Translation (B.D. Hames & S.J. Higgins, eds. (1984)); Animal Cell 
Culture (R.I. Freshney, ed. (1986)); Immobilized Cells And Enzymes (IRL Press, (1986)); B. 
Perbal, A Practical Guide To Molecular Cloning (1984). 

A "polynucleotide" is a single- or double-stranded polymer of deoxyribonucleotide or 
ribonucleotide bases read from the 5' to the 3' end. Polynucleotides include RNA and DNA, 
and may be isolated from natural sources, synthesized in vitro, or prepared from a combination 
of natural and synthetic molecules. 
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A "nucleic acid molecule" or "nucleotide sequence" refers to the phosphate ester 
polymeric form of ribonucleosides (adenosine, guanosine, uridine or cytidine; "RNA 
molecules") or deoxyribonucleosides (deoxy adenosine, deoxyguanosine, deoxythymidine, or 
deoxycytidine; "DNA molecules") in either single stranded form, or a double-stranded helix. 
Double stranded DNA-DNA, DNA-RNA and RNA-RNA helices are possible. The term nucleic 
acid molecule, and in particular DNA or RNA molecule, refers only to the primary and 
secondary structure of the molecule, and does not limit it to any particular tertiary or quaternary 
forms. Thus, this term includes double-stranded DNA found, inter alia, in linear or circular DNA 
molecules (e.g., restriction fragments), plasmids, and chromosomes. In discussing the 
structure of particular double-stranded DNA molecules, sequences may be described herein 
according to the normal convention of giving only the sequence in the 5' to 3' direction along 
the nontranscribed strand of DNA (i.e., the strand having a sequence homologous to the 
mRNA). A "recombinant DNA molecule" is a DNA molecule that has undergone a molecular 
biological manipulation. 

A nucleic acid molecule is "hybridizable" to another nucleic acid molecule, such as a 
cDNA, genomic DNA, or RNA, when a single stranded form of the nucleic acid molecule can 
anneal to the other nucleic acid molecule under the appropriate conditions of temperature and 
solution ionic strength (see Sambrook et al. f supra). The conditions of temperature and ionic 
strength determine the "stringency" of the hybridization. 

For purposes of the present invention, hybridization indicates that the nucleotide 
sequence hybridizes to a labeled polynucleotide probe which hybridizes to the nucleotide 
sequence shown in SEQ ID NO:1 , SEQ ID NO: 3, or SEQ ID NO: 5 under very low to very high 
stringency conditions. Molecules to which the polynucleotide probe hybridizes under these 
conditions may be detected using X-ray film or by any other method known in the art. 
Whenever the term "polynucleotide probe" is used in the present context, it Is to be understood 
that such a probe contains at least 1 5 nucleotides. 

In an interesting embodiment, the polynucleotide probe is the complementary strand 
of a fragment of at least 15 nucleotides of SEQ ID NO:1, SEQ ID NO: 3, or SEQ ID NO: 5. In 
another interesting embodiment, the polynucleotide probe is a fragment of at least 15 
nucleotides of the complementary strand of any nucleotide sequence which encodes the 
polypeptide of SEQ ID NO:2, SEQ ID NO: 4, or SEQ ID NO: 6! In a further interesting 
embodiment, the polynucleotide probe is the complementary strand of SEQ ID NO:1, SEQ ID 
NO: 3, or SEQ ID NO: 5. In a still further interesting embodiment, the polynucleotide probe is 
the complementary strand of the mature polypeptide coding region of SEQ ID NO:1, SEQ ID 
NO: 3, or SEQ ID NO: 5. 

For long probes of at least 100 nucleotides in length, very low to very high stringency 

conditions are defined as prehybridization and hybridization at 42°C in 5X SSPE, 1.0% SDS, 

5X Denhardfs solution, 100 microg/ml sheared and denatured salmon sperm DNA, following 
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standard Southern blotting procedures. Preferably, the long probes of at least 100 nucleotides 
do not contain more than 1000 nucleotides. For long probes of at least 100 nucleotides in 
length, the carrier material is finally washed three times each for 15 minutes using 2 x SSC, 
0.1% SDS at 42°C (very low stringency), preferably washed three times each for 15 minutes 
using 0.5 x SSC, 0.1% SDS at 42°C (low stringency), more preferably washed three times 
each for 15 minutes using 0.2 x SSC, 0.1% SDS at 42°C (medium stringency), even more 
preferably washed three times each for 15 minutes using 0.2 x SSC, 0.1% SDS at 55°C 
(medium-high stringency), most preferably washed three times each for 15 minutes using 0.1 x 
SSC, 0.1% SDS at 60°C (high stringency), in particular washed three times each for 15 
minutes using 0.1 x SSC, 0.1% SDS at 68°C (very high stringency). 

Although not particularly preferred, it is contemplated that shorter probes, e.g. probes 
which are from about 15 to 99 nucleotides in length, such as from about 15 to about 70 
nucleotides in length, may be also be used. For such short probes, stringency conditions are 
defined as prehybridization, hybridization, and washing post-hybridization at 5°C to 10°C below 
the calculated Tm using the calculation according to Bolton and McCarthy (1962, Proceedings 
of the National Academy of Sciences USA 48:1390) in 0.9 M NaCI, 0.09 M Tris-HCI pH 7.6, 6 
mM EDTA, 0.5% NP-40, 1X Denhardt's solution, 1 mM sodium pyrophosphate, 1 mM sodium 
monobasic phosphate, 0.1 mM ATP, and 0.2 mg of yeast RNA per ml following standard 

Southern blotting procedures. 

For short probes which are about 15 nucleotides to 99 nucleotides in length, the 
carrier material is washed once in 6X SCC plus 0.1% SDS for 15 minutes and twice each for 
15 minutes using 6X SSC at 5°C to 10'C below the calculated Tm. 

A DNA "coding sequence" or an "open reading frame (ORF)" is a double-stranded 
DNA sequence which is transcribed and translated Into a polypeptide in a cell in vitro or in vivo 
when placed under the control of appropriate regulatory sequences. The boundaries of the 
coding sequence are determined by a start codon at the 5' (amino) terminus and a translation 
stop codon at the 3' (carboxyl) terminus. A coding sequence can include, but is not limited to, 
prokaryotic sequences, cDNA from eukaryotic mRNA, genomic DNA sequences from 
eukaryotic (e.g., mammalian) DNA, and even synthetic DNA sequences. If the coding 
sequence is intended for expression in a eukaryotic cell, a polyadenylation signal and 
transcription termination sequence will usually be located 3' to the coding sequence. 

An expression vector is a DNA molecule, linear or circular, that comprises a segment 
encoding a polypeptide of interest operably linked to additional segments that provide for its 
transcription. Such additional segments may include promoter and terminator sequences, and 
optionally one or more origins of replication, one or more selectable markers, an enhancer, a 
polyadenylation signal, and the like. Expression vectors are generally derived from plasmid or 
viral DNA, or may contain elements of both. 
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Transcriptional and translational control sequences are DNA regulatory sequences, 
such as promoters, enhancers, terminators, and the like, that provide for the expression of a 
coding sequence in a host cell. In eukaryotic cells, polyadenylation signals are control 
sequences. 

A "secretory signal sequence" is a DNA sequence that encodes a polypeptide <a 
"secretory peptide" that, as a component of a larger polypeptide, directs the larger polypeptide 
through a secretory pathway of a cell in which it is synthesized. The larger polypeptide is 
commonly cleaved to remove the secretory peptide during transit through the secretory 
pathway. 

The term "promoter" is used herein for its art-recognized meaning to denote a portion 
of a gene containing DNA sequences that provide for the binding of RNA polymerase and 
initiation of transcription. Promoter sequences are commonly, but not always, found in the 5' 
non-coding regions of genes. 

A chromosomal gene is rendered non-functional if the polypeptide that the gene 
encodes can no longer be expressed in a functional form. Such non-functionality of a gene can 
be induced by a wide variety of genetic manipulations as known in the art, some of which are 
described in Sambrook et al. vide supra. Partial deletions within the ORF of a gene will often 
render the gene non-functional, as will mutations. 

The term "an expressible copy of a chromosomal gene" is used herein as meaning a 
copy of the ORF of a chromosomal gene, wherein the ORF can be expressed to produce a 
fully functional gene product. The expressible copy may not be transcribed from the native 
promoter of the chromosomal gene, it may instead be transcribed from a foreign or 
heterologous promoter, or it may indeed be promoterless and expressed only by transcriptional 
read-through from a gene present upstream of the 5' end of the ORF. Transcriptional read- 
through is intended to have the same meaning here as the generally recognized meaning in 
the art. 

"Operably linked", when referring to DNA segments, indicates that the segments are 
arranged so that they function in concert for their intended purposes, e.g. transcription initiates 
in the promoter and proceeds through the coding segment to the terminator. 

A coding sequence is "under the control" of transcriptional and translational control 
sequences in a cell when RNA polymerase transcribes the coding sequence into mRNA, which 
is then trans-RNA spliced and translated into the protein encoded by the coding sequence. 

"Heterologous" DNA refers to DNA not naturally located in the cell, or in a 
chromosomal site of the cell. Preferably, the heterologous DNA includes a gene foreign to the 
cell. 

As used herein the term "nucleic acid construct" is intended to indicate any nucleic 

acid molecule of cDNA, genomic DNA, synthetic DNA or RNA origin. The term "construct" is 

intended to indicate a nucleic acid segment which may be single- or double-stranded, and 
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which may be based on a complete or partial naturally occurring nucleotide sequence 
encoding a polypeptide of interest. The construct may optionally contain other nucleic acid 
segments. 

The nucleic acid construct of the invention encoding the polypeptide of the invention 
may suitably be of genomic or cDNA origin, for instance obtained by preparing a genomic or 
cDNA library and screening for DNA sequences coding for all or part of the polypeptide by 
hybridization using synthetic oligonucleotide probes in accordance with standard techniques 

(cf. Sambrook et al., supra). 

The nucleic acid construct of the invention encoding the polypeptide may also be 
prepared synthetically by established standard methods, e.g. the phosphoamidite method 
described by Beaucage and Caruthers, Tetrahedron Letters 22 (1981), 1859 - 1869, or the 
method described by Matthes et al., EMBO Journal 3 (1984), 801 - 805. According to the 
phosphoamidite method, oligonucleotides are synthesized, e.g. in an automatic DNA 
synthesizer, purified, annealed, ligated and cloned in suitable vectors. 

Furthermore, the nucleic acid construct may be of mixed synthetic and genomic, 
mixed synthetic and cDNA or mixed genomic and cDNA origin prepared by ligating fragments 
of synthetic, genomic or cDNA origin (as appropriate), the fragments corresponding to various 
parts of the entire nucleic acid construct, in accordance with standard techniques. The nucleic , 
• acid construct may also be prepared by polymerase chain reaction using specific primers, for 
instance as described in US 4,683,202 or Saiki et al., Science 239 (1988), 487 - 491 . 

The term nucleic acid construct may be synonymous with the term "expression 
cassette" when the nucleic acid construct contains the control sequences necessary for 
expression of a coding sequence of the present invention 

The term "control sequences" is defined herein to include all components which are 
necessary or advantageous for expression of the coding sequence of the nucleic acid 
sequence. Each control sequence may be native or foreign to the nucleic acid sequence 
encoding the polypeptide. Such control sequences include, but are not limited to, a leader, a 
polyadenylation sequence, a propeptide sequence, a promoter, a signal sequence, and a 
transcription terminator. At a minimum, the control sequences include a promoter, and 
transcriptional and translational stop signals. The control sequences may be provided with 
linkers for the purpose of introducing specific restriction sites facilitating ligation of the control 
sequences with the coding region of the nucleic acid sequence encoding a polypeptide. 

The control sequence may be an appropriate promoter sequence, a nucleic acid 
sequence which is recognized by a host cell for expression of the nucleic acid sequence. The 
promoter sequence contains transcription and translation control sequences which mediate the 
expression of the polypeptide. The promoter may be any nucleic acid sequence which shows 
transcriptional activity in the host cell of choice and may be obtained from genes encoding 

extracellular or intracellular polypeptides either homologous or heterologous to the host cell. 
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The control sequence may also be a suitable transcription terminator sequence, a 
sequence recognized by a host cell to terminate transcription. The terminator sequence is 

* 

operably linked to the 3' terminus of the nucleic acid sequence encoding the polypeptide. Any 
terminator which is functional in the host cell of choice may be used in the present invention. 

The control sequence may also be a polyadenylation sequence, a sequence which is 
operably linked to the 3* terminus of the nucleic acid sequence and which, when transcribed, is 
recognized by the host cell as a signal to add polyadenosine residues to transcribed mRNA. 
Any polyadenylation sequence which is functional in the host cell of choice may be used in the 
present invention. 

The control sequence may also be a signal peptide coding region, which codes for an 
amino acid sequence linked to the amino terminus of the polypeptide which can direct the 
expressed polypeptide into the cell's secretory pathway of the host cell. The 5* end of the 
coding sequence of the nucleic acid sequence may inherently contain a signal peptide coding 
region naturally linked in translation reading frame with the segment of the coding region which 
encodes the secreted polypeptide. Alternatively, the 5' end of the coding sequence may 
contain a signal peptide coding region which is foreign to that portion of the coding sequence 
which encodes the secreted polypeptide. A foreign signal peptide coding region may be 
required where the coding sequence does not normally contain a signal peptide coding region. 
- Alternatively, the foreign signal peptide coding region may simply replace the natural signal 
* peptide coding region in order to obtain enhanced secretion of the exoprotein relative to the «. 
natural signal peptide coding region normally associated with the coding sequence. The signal 
peptide coding region may be obtained from a glucoamylase or an amylase gene from an 
Aspergillus species, a lipase or proteinase gene from a Rhizomucor species, the gene for the 
alpha-factor from Saccharomyces cerevisiae, an amylase or a protease gene from a Bacillus 
species, or the calf preprochymosin gene. However, any signal peptide coding region capable 
of directing the expressed polypeptide into the secretory pathway of a host cell of choice may 
be used in the present invention. 

The control sequence may also be a propeptide coding region, which codes for an 
amino acid sequence positioned at the amino terminus of a polypeptide. The resultant 
polypeptide is known as a proenzyme or propolypeptide (or a zymogen in some cases). A 
propoiypeptide is generally inactive and can be converted to mature active polypeptide by 
catalytic or autocatalytic cleavage of the propeptide from the propolypeptide. The propeptide 
coding region may be obtained from the Bacillus subtilis alkaline protease gene (aprE), the 
Bacillus subtilis neutral protease gene (nprT), the Saccharomyces cerevisiae alpha-factor 
gene, or the Myceliophthora thermophilum laccase gene (WO 95/33836). 

It may also be desirable to add regulatory sequences which allow the regulation of the 
expression of the polypeptide relative to the growth of the host cell. Examples of regulatory 

systems are those which cause the expression of the gene to be turned on or off in response 
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to a chemical or physical stimulus, including the presence of a regulatory compound. 
Regulatory systems in prokaryotic systems would include the lac, tac, and trp operator 
systems. 

Examples of suitable promoters for directing the transcription of the gene(s) of the 
present invention, especially in a bacterial host cell, are the promoters obtained from the E. coli 
lac operon, the Streptomyces coelicolor agarase gene {dagA) t the Bacillus subtilis 
levansucrase gene (sacB), the Bacillus subtilis alkaline protease gene, the Bacillus 
licheniformis alpha-aniylase gene (amyL), the Bacillus stearothermophUus maltogenic amylase 
gene (amyM), the Bacillus amyloliquefaciens alpha-amylase gene (a/nyQ), the Bacillus 
amyloliquefaciens BAN amylase gene, the Bacillus licheniformis penicillinase gene (penP), the 
Bacillus subtilis xylA and xylB genes, and the prokaryotic beta-lactamase gene (Villa-Kamaroff 
et al. a 1978, Proceedings of the National Academy of Sciences USA 75:3727-3731), as well as 
the tac promoter (DeBoer et al., 1983, Proceedings of the National Academy of Sciences USA 
80:21-25). Further promoters are described in "Useful proteins from recombinant bacteria" in 
Scientific American, 1980, 242:74-94; and in Sambrook et al., 1989, supra. 

An effective signal peptide coding region for bacterial host cells is the signal peptide 
coding region obtained from the maltogenic amylase gene from Bacillus NCIB 11837, the 
Bacillus stearothermophUus alpha-amylase gene, the Bacillus licheniformis subtilisin gene, the 
• Bacillus licheniformis beta-lactamase gene, the Bacillus stearothermophUus neutral proteases 
genes?(nprT, nprS, nprM),jand the Bacillus subtilis PrsA gene. Further signal peptides are 
described by Simonen and Palva, 1993, Microbiological Reviews 57:109-137. 

The present invention also relates to recombinant expression vectors comprising a 
nucleic acid sequence of the present invention, a promoter, and transcriptional and 
translational stop signals. The various nucleic acid and control sequences described above 
may be joined together to produce a recombinant expression vector which may include one or 
more convenient restriction sites to allow for insertion or substitution of the nucleic acid 
sequence encoding the polypeptide at such sites. Alternatively, the nucleic acid sequence of 
the present invention may be expressed by inserting the nucleic acid sequence or a nucleic 
acid construct comprising the sequence into an appropriate vector for expression. In creating 
the expression vector, the coding sequence is located in the vector so that the coding 
sequence is operably linked with the appropriate control sequences for expression, and 
possibly secretion. 

The recombinant expression vector may be any vector (e.g., a plasmid or virus) which 

can be conveniently subjected to recombinant DNA procedures and can bring about the 

expression of the nucleic acid sequence. The choice of the vector will typically depend on the 

compatibility of the vector with the host cell into which the vector is to be introduced. The 

vectors may be linear or closed circular plasmids. The vector may be an autonomously 

replicating vector, i.e., a vector which exists as an extrachromosomal entity, the replication of 
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which is independent of chromosomal replication, e.g., a plasmid, an extrachromosomal 
element, a minichromosome, or an artificial chromosome. The vector may contain any means 
for assuring self-replication. Alternatively, the vector may be one which, when introduced into 
the host cell, is integrated into the genome and replicated together with the chromosome(s) 
into which it has been integrated. The vector system may be a single vector or plasmid or two 
or more vectors or plasmids which together contain the total DNA to be introduced into the 
genome of the host cell, or a transposon. 

The vectors of the present invention preferably contain one or more selectable 
markers which permit easy selection of transformed cells. A selectable marker is a gene the 
product of which provides for biocide or viral resistance, resistance to heavy metals, 
prototrophy to auxotrophs, and the like. 

Antibiotic selectable markers confer antibiotic resistance to such antibiotics as 
ampicillin, kanamycin, chloramphenicol, tetracycline, neomycin, hygromycin or methotrexate. 
Suitable markers for yeast host cells are ADE2, HIS3, LEU2, LYS2, MET3. TRP1 , and URA3. 

The vectors of the present invention preferably contain an element(s) that permits 
stable integration of the vector, or of a smaller part of the vector, into the host cell genome or 
autonomous replication of the vector in the cell independent of the genome of the cell. 
v The vectors, or smaller parts of the vectors such as amplification units of the present 

invention, may be integrated into the host cell genome when introduced into a* host cell. For 
■chromosomal integration, the vector may rely on the nucleic acid sequence encoding the 
polypeptide or any other element of the vector for stable integration of the vector into the 
genome by homologous or nonhomologous recombination. 

Alternatively, the vector may contain additional nucleic acid sequences for directing 
integration by homologous recombination into the genome of the host cell. The additional 
nucleic acid sequences enable the vector to be integrated into the host cell genome at a 
precise location(s) jn the chromosome(s). To increase the likelihood of integration at a precise 
location, the integrational elements should preferably contain a sufficient number of nucleic 
acids, such as 100 to 1,500 base pairs, preferably 400 to 1,500 base pairs, and most 
preferably 800 to 1,500 base pairs, which are highly homologous with the corresponding target 
sequence to enhance the probability of homologous recombination. The integrational 
elements may be any sequence that is homologous with the target sequence in the genome of 
the host cell. Furthermore, the integrational elements may be non-encoding or encoding 
nucleic acid sequences; specific examples of encoding sequences suitable for site-specific 
integration by homologous recombination are given in WO 02/00907 (Novozymes, Denmark), 
which is hereby incorporated by reference in its totality. 

On the other hand, the vector may be integrated into the genome of the host cell by 
non-homologous recombination. These nucleic acid sequences may be any sequence that is 
homologous with a target sequence in the genome of the host cell, and, furthermore, may be 
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non-encoding or encoding sequences. The copy number of a vector, an expression cassette, 
an amplification unit, a gene or indeed any defined nucleotide sequence is the number of 
identical copies that are present in a host cell at any time. A gene or another defined 
chromosomal nucleotide sequence may be present in one, two, or more copies on the 
chromosome. An autonomously replicating vector may be present in one, or several hundred 
copies per host cell. 

An amplification unit of the invention is a nucleotide sequence that can integrate into 
the chromosome of a host cell, whereupon it can increase in number of chromosomally 
integrated copies by duplication of multiplication. The unit comprises an expression cassette 
as defined herein comprising at least one copy of a gene of interest and an expressable copy 
of a chromosomal gene, as defined herein, of the host cell. When the amplification unit is 
integrated into the chromosome of a host cell, it is defined as that particular region of the 
chromosome which is prone to being duplicated by homologous recombination between two 
directly repeated regions of DNA. The precise border of the amplification unit with respect to 
the flanking DNA is thus defined functionally, since the duplication process may indeed 
duplicate parts of the DNA which was introduced into the chromosome as well as parts of the 
endogenous chromosome itself, depending on the exact site of recombination within the 
repeated regions. This principle is illustrated in Janniere et al. (1985, Stable gene amplification 
it- in the chromosome of Bacillus subtilis. Gene, 40: 47-55), which is incorporated herein by 
reference. '* 

For autonomous replication, the vector may further comprise an origin of replication 
enabling the vector to replicate autonomously in the host cell in question. Examples of 
bacterial origins of replication are the origins of replication of plasmids pBR322, pUC19. 
PACYC177, pACYC184, pUB110, pE194, pTA1060, and pAMbetal. Examples of origin of 
replications for use in a yeast host cell are the 2 micron origin of replication, the combination of 
CEN6 and ARS4, and the combination of CEN3 and ARS1. The origin of replication may be 
one having a mutation which makes its functioning temperature-sensitive in the host cell (see, 
e.g., Ehrlich, 1978, Proceedings of the National Academy of Sciences USA 75:1433). 

The present invention also relates to recombinant host cells, comprising a nucleic acid 
sequence of the invention, which are advantageously used in the recombinant production of 
the polypeptides. The term "host ceir encompasses any progeny of a parent cell which is not 
identical to the parent cell due to mutations that occur during replication. 

The cell is preferably transformed with a vector comprising a nucleic acid sequence of 

the invention followed by integration of the vector into the host chromosome. "Transformation 0 

means introducing a vector comprising a nucleic acid sequence of the present invention into a 

host cell so that the vector is maintained as a chromosomal integrant or as a self-replicating 

extra-chromosomal vector. Integration is generally considered to be an advantage as the 

nucleic acid sequence is more likely to be stably maintained in the cell. Integration of the 
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vector into the host chromosome may occur by homologous or non-homologous recombination 
as described above. 

The transformation of a bacterial host cell may, for instance, be effected by protoplast 
transformation (see, e.g., Chang and Cohen, 1979, Molecular General Genetics 168:111-115), 
by using competent cells (see, e.g., Young and Spizizin, 1961, Journal of Bacteriology 81:823- 
829 or Dubnar and Davidoff-Abelson, 1971, Journal of Molecular Biology 56:209-221), by 
electroporation (see, e.g., Shigekawa and Dower, 1988, Biotechniques 6:742-751), or by 
conjugation (see, e.g., Koehler and Thorne, 1987, Journal of Bacteriology 169:5771-5278). 

The transformed or transfected host cells described above are cultured in a suitable 
nutrient medium under conditions permitting the expression of the desired polypeptide, after 
which the resulting polypeptide is recovered from the cells, or the culture broth. 

The medium used to culture the cells may be any conventional medium suitable for 
growing the host cells, such as minimal or complex media containing appropriate supplements. 
Suitable media are available from commercial suppliers or may be prepared according to 
published recipes (e.g. in catalogues of the American Type Culture Collection). The media are 
prepared using procedures known in the art (see, e.g., references for bacteria and yeast; 
Bennett, J.W. and LaSure, L., editors, More Gene Manipulations in Fungi, Academic Press, 
CA, 1991). 

-The polypeptide are recovered from the culture medium by conventional procedures 
including separating the host cells from the medium by centrifugation or filtration, precipitating 
the proteinaceous components of the supernatant or filtrate by means of a salt, e.g. 
ammonium sulphate, purification by a variety of chromatographic procedures, e.g. ion 
exchange chromatography, gelfiltration chromatography, affinity chromatography, or the like, 
dependent on the type of polypeptide in question. 

The polypeptides may be detected using methods known in the art that are specific for 
the polypeptides. These detection methods may include use of specific antibodies, formation 
of an enzyme product, or disappearance of an enzyme substrate. For example, an enzyme 
assay may be used to determine the activity of the polypeptide. 

The polypeptides of the present invention may be purified by a variety of procedures 
known in the art including, but not limited to, chromatography (e.g., ion exchange, affinity, 
hydrophobic, chromatbfocusing, and size exclusion), electrophoretic procedures (e.g., 
preparative isoelectric focusing (IEF), differential solubility (e.g., ammonium sulfate 
precipitation), or extraction (see, e.g., Protein Purification, J.-C. Janson and Lars Ryden, 
editors, VCH Publishers, New York, 1989). 

In the present context, the term "substantially pure polypeptide" means a polypeptide 
preparation which contains at the most 10% by weight of other polypeptide material with which 
it is natively associated (lower percentages of other polypeptide material are preferred, e.g. at 

the most 8% by weight, at the most 6% by weight, at the most 5% by weight, at the most 4% at 
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the most 3% by weight, at the most 2% by weight, at the most 1% by weight, and at the most 
1 / 2 % by weight). Thus, it is preferred that the substantially pure polypeptide is at least 92% 
pure, i.e. that the polypeptide constitutes at least 92% by weight of the total polypeptide 
material present in the preparation, and higher percentages are preferred such as at least 94% 
pure, at least 95% pure, at least 96% pure, at least 96% pure, at least 97% pure, at least 98% 
pure, at least 99%, and at the most 99.5% pure. The polypeptides disclosed herein are 
preferably in a substantially pure form. In particular, it is preferred that the polypeptides 
disclosed herein are in "essentially pure form", i.e. that the polypeptide preparation is 
essentially free of other polypeptide material with which it is natively associated. This can be 
accomplished, for example, by preparing the polypeptide by means of well-known recombinant 
methods. Herein, the term "substantially pure polypeptide 0 is synonymous with the terms 
"isolated polypeptide" and "polypeptide in isolated form". 

In the present context, the homology between two amino acid sequences or between 
two nucleotide sequences is described by the parameter "identity". For purposes of the present 
invention, alignments of sequences and calculation of homology scores may be done using a 
full Smith-Waterman alignment, useful for both protein and DNA alignments. The default 
scoring matrices BLOSUM50 and the identity matrix are used for protein and DNA alignments 
respectively. The penalty for the first residue in a gap is -12 for proteins and -16 for DNA, while 
the penalty for additional residues in a gap-is -2 for proteins and -4 for DNA. Alignment may be 
made with the FASTA package version v20u6 (W. R. Pearson and D..J. Lipman (1988), 
"Improved Tools for Biological Sequence Analysis", PNAS 85:2444-2448, and W. R. Pearson 
(1990) "Rapid and Sensitive Sequence Comparison with FASTP and FASTA", Methods in 
Enzymology, 183:63-98). 

Multiple alignments of protein sequences may be made using "ClustalW" (Thompson, 
J.D., Higgins. D.G. and Gibson, T.J. (1994) CLUSTAL W: improving the sensitivity of 
progressive multiple sequence alignment through sequence weighting, positions-specific gap 
penalties and weight matrix choice. Nucleic Acids Research, 22:4673-4680). Multiple 
alignment of DNA sequences may be done using the protein alignment as a template, 
replacing the amino acids with the corresponding codon from the DNA sequence. 

In the present context, a functional homologue of the YusZ or YusX protein is a 
protein, which when expressed at a reduced level in a cell, leads to an increased secretion of a 
heterologous polypeptide, preferably an enzyme such as an alpha-amylase, when compared 
with an isogenic cell having a normal expression of the YusZ or YusX functional homologue, 
where both are cultivated under essentially identical conditions. In addition, the functional 
homologue of the YusZ or YusX protein shares an amino acid sequence identity with the 
respective YusZ or Yus X protein of at least 50%, preferably 55%, 60%, 65%, 70%, 75%, 80%, 
85%, 90%, 95%, 97%, or most preferably 99% when aligned as described above. 
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In the present context, the term "allelic variant 0 denotes any of two or more alternative 
forms of a gene occupying the same chromosomal locus. Allelic variation arises naturally 
through mutation, and may result in polymorphism within populations. Gene mutations can be 
silent (no change in the encoded polypeptide) or may encode polypeptides having altered 
amino acid sequences. An allelic variant of a polypeptide is a polypeptide encoded by an 
allelic variant of a gene. Allelic variants are included in the present definition of functional 
homologues. 

The YusZ or YusX protein or functional homologue thereof may be a wild-type protein 
identified and isolated from a natural source. Such wild-type proteins may be specifically 
screened for by standard techniques known in the art. Furthermore, genes encoding the YusZ 
or YusX protein, or a functional homologue thereof, may be prepared by the DNA shuffling 
technique, such as described in J.E. Ness et a). Nature Biotechnology 17, 893-896 (1999). 
Moreover, the YusZ or YusX protein, or functional homologue thereof, may be an artificial 
variant. Such artificial variants may be constructed by standard techniques known in the art, 
such as by site-directed/random mutagenesis. In one embodiment of the invention, amino acid 
changes (in the artificial variant as well as in wild-type polypeptides) are of a minor nature, that 
is conservative amino acid substitutions that do not significantly affect the folding and/or 
. activity of the protein; small deletions, typically of one to about 30 amino acids; small amino- or 
• . carboxyl-terminal extensions, such as an amino^erminal methionine residue; a small linker: 
: peptide of up to about 20-25 residues; or a small extension that facilitates purification by 
changing net charge or another function, such as a poly-histidine tract, an antigenic epitope or 
a binding domain. 

Examples of conservative substitutions are within the group of basic amino acids 
(arginine, lysine and histidine), acidic amino acids (glutamic acid and aspartic acid), polar 
amino acids (glutamine and asparagine), hydrophobic amino acids (leucine, isoleucine, valine 
and methionine), aromatic amino acids (phenylalanine, tryptophan and tyrosine), and small 
amino acids (glycine, alanine, serine and threonine). Amino acid substitutions which do not 
generally alter the specific activity are known in the art and are described, for example, by H. 
Neurath and R.L. Hill, 1979, In, The Proteins, Academic Press, New York. The most commonly 
occurring exchanges are Ala/Ser, Val/lle, Asp/Glu, Thr/Ser, Ala/Gly, AiaH*hr, Ser/Asn, AlaA/al, 
Ser/Gly, Tyr/Phe, Ala/Pro, Lys/Arg, Asp/Asn, Leu/lle t LeuA/al, Ala/Glu, and Asp/Gly as well as 
these in reverse. 

It will be apparent to those skilled in the art that such modifications can be made 
outside the regions critical to the function of the molecule and still result in an active 
polypeptide. Amino acid residues essential to the activity of the polypeptide encoded by the 
nucleotide sequence of the invention, and therefore preferably not subject to modification, such 
as substitution, may be identified according to procedures known in the art, such as site- 
directed mutagenesis or alanine-scanning mutagenesis (see, e.g., Cunningham and Wells, 
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1989, Science 244: 1081-1085). In the latter technique, mutations are introduced at every 
positively charged residue in the molecule, and the resultant mutant molecufes are tested for 
activity to identify amino acid residues that are critical to the activity of the molecule. Sites of 
substrate-enzyme interaction can also be determined by analysis of the three-dimensional 
structure as determined by such techniques as nuclear magnetic resonance analysis, 
crystallography or photoaffinity labelling (see, e.g.. de Vos et al., 1992, Science 255: 306-312; 
Smith et al., 1992, Journal of Molecular Biology 224: 899-904; Wlodaver et al„ 1992, FEBS 
Letters 309: 59-64). 

Moreover, a nucleotide sequence encoding a polypeptide of the present invention 
may be modified by introduction of nucleotide substitutions which do not give rise to another 
amino acid sequence of the polypeptide encoded by the nucleotide sequence, but which 
correspond to the codon usage of the host organism intended for production of the enzyme. 

The introduction of a mutation into the nucleotide sequence to exchange one 
nucleotide for another nucleotide may be accomplished by site-directed mutagenesis using 
any of the methods known in the art. Particularly useful is the procedure, which utilizes a 
supercoiled, double stranded DNA vector with an insert of Interest and two synthetic primers 
containing the desired mutation. The oligonucleotide primers, each complementary to opposite 
strands of the vector, extend during temperature cycling by means of Pfu DNA polymerase. 
• On incorporation of the primers, a mutated^plasmid containing staggered nicks is- generated. • 
Following temperature cycling, the product is treatedtwith Dpnl which is specific for methylated 
and hemimethylated DNA to digest the parental DNA template and to select for mutation- 
containing synthesized DNA. Other procedures known in the art may also be used. For a 
general description of nucleotide substitution, see, e.g., Ford et al., 1991. Protein Expression 
and Purification 2: 95-107. 

SUMMARY OF THE INVENTION 

The yusZ DNA sequence is shown in SEQ ID NO: 1, the putative encoded amino acid 
sequence is shown in SEQ ID NO: 2, the yusX DNA sequence is shown in SEQ ID NO: 3, and 
the putative encoded amino acid sequence is shown in SEQ ID NO: 4; the yusY DNA 
sequence is shown in SEQ ID NO: 5, and the putative encoded amino acid sequence is 
shown in SEQ ID NO: 6. 

A problem to be solved is how to provide increased secretion of heterologous 
polypeptides produced in prokaryotic cells. The present invention provides mutated prokaryotic 
cells which have a reduced expression-level of YusZ (SEQ ID NO: 2), YusX <SEQ ID NO: 4), 
or homologues thereof, and which secrete higher amounts of at least one heterologous 
polypeptide of interest, when compared with respective corresponding otherwise isogenic but 
non-mutated cells. Typically, a mutated cell of the invention is compared under identical growth 
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conditions with the non-mutated parent cell from which the mutant was derived; the parent cell 
will be completely isogenic with the mutated cell, except for the mutations leading to the 
reduced YusZ or YusX expression-levels. The inventors have found that a reduced 
expression-level of YusZ or YusX in a prokaryotic host cell leads to a higher yield of secreted 
heterologous polypeptides. This result is highly interesting for the industrial production of 
secreted polypeptides such as enzymes. 

Accordingly, in a first aspect the invention relates to a mutated prokaryotic cell, which 
has a reduced expression-level of YusZ (SEQ ID NO: 2), YusX (SEQ ID NO: 4), or 
homologues thereof, and which secretes higher amounts of at least one heterologous 
polypeptide of interest, when compared with an otherwise isogenic but non-mutated cell. 

In a second aspect the invention relates to a method for constructing a mutated 
prokaryotic cell, said method comprising the steps of: 

a) mutating a prokaryotic cell; and 

b) selecting a mutated cell which has a reduced expression-level of YusZ (SEQ ID NO: 
2) or YusX (SEQ ID NO: 4), or homologues thereof, and which secretes higher 
amounts of at least one heterologous polypeptide of interest, when compared with an 
otherwise isogenic but non-mutated cell. 

A final aspect of the invention relates to a method for producing a polypeptide of 
interest/said method comprising the steps of: - * 1 

a) cultivating a mutated prokaryotic cell, .which has a reduced expression-level of YusZ". 
(SEQ ID NO: 2), YusX (SEQ ID NO: 4), or homologues thereof, and which secretes 
higher amounts of the polypeptide of interest, when compared with an otherwise 
isogenic but non-mutated cell; and 

b) isolating the polypeptide of interest. 

DETAILED DESCRIPTION 

The first aspect of the invention relates to a mutated prokaryotic cell, which has a 
reduced expression-level of YusZ (SEQ ID NO: 2), YusX (SEQ ID NO: 4), or homologues 
thereof, and which secretes higher amounts of at least one heterologous polypeptide of 
interest, when compared with an otherwise isogenic but non-mutated cell. 

A preferred embodiment of the invention relates to a cell of of the first aspect, which is 
a Gram-positive cell, preferably a Bacillus cell, more preferably a B.atkatophilus, 
B.amyloliquefaciens, B.brevis, B.circulans, B.clausii, B.coagulans, BJautus, B.lentus, 
BJicheniformis, B.megaterium, B.stearothermophilus, B.subtilis, or B.thuringiensis cell; or to 
methods of the second or third aspects, wherein the cell is as listed here. 

An evolutionary homologue of the YusZ or YusX protein, an allelltc variant, an artificial 
variant, a shuffled protein, a species variant, and so forth, are all referred to as a "functional 

homologue" or the YusZ or YusX protein in the present description, and the inventors envision 
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that reduced expression of such functional homologue protein(s) will be equally effective in the 

cell and methods of the invention. 

Specifically, a preferred embodiment relates to the cell, wherein the YusZ or YusX 
protein or functional homologue thereof comprises an amino acid sequence which is at least 
70% identical to the amino sequence shown in SEQ ID NO:2 or 5EQ ID NO: 4, respectively; 
preferably at least 75%, 80%, 85%, 90%, 95%, 97%, or even 99% identical to the amino 
sequence shown in SEQ ID NO:2 or SEQ ID NO: 4, respectively. 

Another preferred embodiment relates to the cell of the invention, or the methods of 
the invention, wherein the YusZ or YusX protein or functional homologue thereof comprises or 
consists of the amino acid sequence shown in SEQ ID NO:2 or SEQ ID NO:4, respectively. 

Due to the organisation of the yusX and yusY genes in an operon, reduced 
expression of YusX may be achieved by mutating the encoding gene, or by mutating the 
immediately upstream open reading frame of the operon, yusY. 

Accordingly, in a preferred embodiment of the invention, the reduced expression of 
YusZ or YusX or homologues thereof is achieved by mutating one .or more respective 
encoding gene, a cell of the invention is preferably mutated in yusZ (SEQ ID NO: 1), yusX 
(SEQ ID NO: 3), and/or yusY (SEQ ID NO: 5), or homologues thereof; and preferably the yusZ, 
yusX, and/or yusY homologues encode a polypeptide having an amino acid sequence at least 
• 70% identical to the sequence shown in SEQ ID NO: 2, SEQ ID NO: 4/ or SEQ ID NO: 6, ir- 
respectively; or preferably at .least 75%, 80%, 85%, 90%, 95%, 97%, or even 99% identical to * : 
the amino sequence shown in SEQ ID NO:2, SEQ ID NO: 4, or SEQ ID NO: 6, respectively; 
most preferably, the yusZ, yusX, and/or yusY homologues have a nucleotide sequence at least 
70% identical to the sequence shown in SEQ ID NO: 1. SEQ ID NO: 3, or SEQ ID NO: 5, 
respectively; or preferably at least 75%, 80%, 85%, 90%. 95%, 97%, or even 99% identical to 
the sequence shown in SEQ ID NO: 1, SEQ ID NO: 3, or SEQ ID NO: 5, respectively. 

As mentioned elsewhere herein, one way of identifying functional yusZ, yusX, or YusY 
genes in a cell is by hybridization. Accordingly, a preferred embodiment relates to a cell of the 
first aspect, or methods of the second or third aspects wherein the cell is mutated in at least 
one polynucleotide, where a subsequence having a size of at least 100 bp of the at least one 
polynucleotide hybridizes with a polynucleotide having the sequence shown in SEQ ID NO: 1, 
SEQ ID NO: 3, or SEQ ID NO: 5. or the respective complementary sequences, under very low 
to very high stringency conditions, preferably very low, low, medium, medium-high, high, or 
very high stringency hybridization conditions. 

The cell of the present invention may be mutated in any suitable manner and 
procedures for performing such mutagenesis are very well-known in the art. 

A preferred embodiment of the invention relates to a cell of the first aspect, in which 
yusZ, yusX, and/or yusY, or homologues thereof, is/are partially or fully deleted from the 
chromosome. 
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Another preferred embodiment relates to a cell, in which yusZ, yusX, and/or yusY, or 
homologues thereof, comprise at least one frameshift mutation or non-sense mutation. 

The mutated cell of the invention has a reduced expression-level of YusZ or YusX 
protein or a funtional homologue thereof, than an otherwise isogenic but non-mutated cell, e.g. 
a parent cell. A comparison should be made by cultivating the cell of the invention as well as 
the isogenic but non-mutated cell under essentially identical conditions, and comparing the 
amount of YusZ or YusX protein by any standard method in the art. Preferably the cell of the 
invention produces at least 5% less YusZ or YusX than the non-mutated cell, more preferably 
at least 10%,'still more preferably at least 20%, and most preferably at least 50% less YusZ or 
YusX protein or a functional homologue thereof. 

In a preferred embodiment, the cell of the invention has at least a two-fold reduced 
expression-level of YusZ or YusX, or homologues thereof, when compared with the otherwise 
isogenic but non-mutated cell; preferably the cell has no measureable expression of YusZ or 
YusX, or homologues thereof, when compared with the otherwise isogenic but non-mutated 
cell. 

As the inventors show herein, a cell of the first aspect is capable of secreting greater 
amounts of a heterologous polypeptide of interest than the corresponding isogenic but non- 
mutated cell, when both are cultivated. under essentially identical conditions. :t 
. • -Accordingly, a preferred embodiment of the invention relates to the cell of the .first 
aspect, which secretes greater, amounts of a heterologous polypeptide of interest thanran 
otherwise isogenic but non-mutated cell. Preferably the cell of the invention secretes at least 
5% more, more preferably at least 10% more, still more preferably at least 20% more, and 
most preferably at least 50% more than the non-mutated cell. The amount of secreted 
heterologous polypeptide from the cell may be determined by any suitable assay in the art; a 
non-limiting example is shown betow with the determination of secreted amounts of alpha- 
amylase. 

In a preferred embodiment of the invention, the at least one heterologous polypeptide 
comprises an enzyme, preferably the enzyme is a lyase, a ligase, a hydrolase, an 
oxidoreductase, a transferase, or an isomerase. 

Methods of stably integrating one or more copies of polynucleotides encoding 
heterologous polypeptides into the chromosome of prokaryotic cells are well described in the 
art, for instance in WO 94/14968, WO 99/41358, WO 91/09129, WO 02/00907, or WO 
01/90393, which are all incorporated herein by reference in their entirety. 

Accordingly, in a preferred embodiment of the invention, the cell comprises one or 
more chromosomally integrated copies of a polynucleotide encoding the at least one 
heterologous polypeptide. 

The skilled person is well aware that increased expression of the polynucleotide 

encoding the polypeptide of interest is advantageous in the industrial production of 
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polypeptides, and it is common knowledge in the art that increasing promoter strength is one 
way of achieving increased expression, see WO 99/43835, WO 93/10249, WO 98/07846, or 
WO 03/008575, which are incorporated herein by reference in their entirety. 

A preferred embodiment relates to the cell of the invention, wherein the at least one 
heterologous polypeptide is encoded by a polynucleotide which is transcribed from at least one 
heterologous promoter, preferably the at least one promoter comprises an artificial promoter, 
and more preferably the artificial promoter comprises one or more mRNA-stabilizing sequence, 
preferably derived from the c/yllla promoter. 

EXAMPLES 

Materials and methods 

Strains 

B.subtilis 168. F. Kunst et al. "The complete genome sequence of the Gram-positive 
bacterium Bacillus subtilis" Nature (1 997) 390:249-256. 

B.subtilis AN83. This strain is the B.subtilis 168 with the plasmid pKTH10 which constitutively 
expresses an amylase in high amounts. 

B.subtilis AN133. This strain is the B.subtilis 168 strain wherein the yusZgene is deleted. 
• ■ * ' B.subtilis AN1 37." This strain is AN133 with the plasmid- pKTH10 which constitutively-expresses 
■an amylase in high amounts. -- v - 
B.subtilis AN151. This strain is the B.subtilis 168 strain wherein the yusX gene is deleted 
B.subtilis AN155.This strain is AN151 with the plasmid pKTH10 which constitutively expresses 
an amylase in high amounts. 

Primers: 

yusZIF (SEQ ID NO: 7): ccttcccggggctaagcttttcggc 
yusZ2R (SEQ ID NO: 8): gatagactcccacgcgctggacgctcctgt 
yusZ2F (SEQ ID NO: 9): acaggagcgtccagcgcgtgggagtctatc 
yusZ3R (SEQ ID NO: 10):aacggtaccctgaccaagcagacag 
yusXIF (SEQ ID NO: 11):aatgcccgggcaagctttacagctg 
^ yusX2R (SEQ ID NO: 12): ggcgtcacgcacagcaacgagcgacgattg 
yusX2F (SEQ ID NO: 13): caatcgtcgctcgttgctgtgcgtgacgcc 
yusX3R (SEQ ID NO: 14): aatcggtaccatcataatgactgtc 

Piasmids 
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pKTHIO: Vehmaanpera J, Steinborn G, Hofemeister J.: n Genetic manipulation of 
Bacillus amyloliquefaciens." J Biotechnol. i991 Jul;19(2-3):221-40. This plasmid constitutively 
express the B. amyloliquefaciens alpha-amylase (AmyQ). 

pSJ6410: a derivative of plasmid pSJ2739 (described in U.S.Patent 6,100,063), which 
is again derived from pE194, naturally temperature-sensitive for replication. pSJ6410 consists 
of the pE194 replicon, as well as a fragment derived from plasmid pUB110 and a Bacillus 
licheniformis alpha-amylase gene preceded by a fragment from the Bacillus thuringiensis 
crylllA regulatory region. These additional fragmens are irrelevant for the use of pSJ6410 as a 
vector in the present invention. 

pAN28: constructed by ligating the PCR product yi/szSOEpcr (SEQ ID NO: 15), cut 
with restriction enzymes Xmal and Kpn\, to the large Xma\-Kpn\ fragment of pSJ6410. This 
plasmid,. which contains the temperature sensitive origin of pE194, was used for deletion of the 
yusZ gene from the chromosome of B. subtilis 168 by a double cross-over event. The PCR 
product yuszSOEpcr was generated by use of the technique of splicing by overlap extension 
by the polymerase chain reaction (SOE by PCR, Horton RM etal. Biotechniques. 1990 
May;8(5):528-35). Two intermediate PCR products, PCR1 and PCR2, each tipped with a small 
sequence of the other, were mixed in a second-stage PCR to produce the final spliced product, 
yuszSOEpcf; PCR1, generated by use of primers yusZIF and yusZ2R, contains yusZ 
upstream sequence (655bp). PCR2, generated by use of primers yusZ2F and yusZ3R, 
contains yusZ downstream sequence (690bp). Chromosomal DNA from B. subtilis 168 was 
used as a template for PCR. The spliced product (1315bp) f wherein the yusZ gene is reduced 
from 280aa to 25aa. was generated in the second-stage PCR using PCR1 and PCR2 as 
templates, and yusZIF and yusZ3R as primers. The entire nucleotide sequence of plasmid 
pAN28 is shown in SEQ ID NO: 16. 

pAN23: constructed by ligating the PCR product yusxSOEpcr (SEQ ID NO: 17), cut 

with restriction enzymes Xmal and Kpn\ t to the large Xma\-Kpn\ fragment of pSJ6410. This 

plasmid, which contains the temperature sensitive origin of pE1 94, was used for deletion of the 

yusX gene from the chromosome of S. subtilis 168 by a double cross-over event. The PCR 

product yusxSOEpcr was generated by use of the technique of splicing by overlap extension 

by the polymerase chain reaction (SOE by PCR, Horton RM etal. Biotechniques. 1990 

May;8(5):528-35). Two intermediate PCR products, PCR1 and PCR2, each tipped with a small 

sequence of the other, were mixed in a second-stage PCR to produce the final spliced product, 

yusxSOEpcr. PCR1, generated by use of primers yusXIF and yusX2R, contains yusX 

upstream sequence (560bp). PCR2, generated by use of primers yusX2F and yusX3R, 

contains yusX downstream sequence (560bp). Chromosomal DNA from B. subtilis 168 was 

used as a template for the PCR. The spliced product (1090bp), wherein the yusX gene is 

reduced from 500aa to 27aa, was generated in the second-stage PCR using PCR1 and PCR2 
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as templates, and yusXIF and yusX3R as primers. The entire sequence of plasmid pAN23 is 
shown in SEQ ID NO: 18. 

General molecular biology methods 

Unless otherwise mentioned, the DNA manipulations and transformations were 
performed using standard methods of molecular biology (Sambrook et al. (1989) Molecular 
cloning; A laboratory manual, Cold Spring Harbor lab., Cold Spring Harbor, NY; Ausubel, F. M. 
et al. (eds.) "Current protocols in Molecular Biology". John Wiley and Sons, 1995; Harwood, C. 
R., and Cutting, S. M. (eds.) "Molecular Biological Methods for Bacillus". John Wiley and Sons, 
1 990). 

Enzymes for DNA manipulations were used according to the specifications of the 
suppliers (e.g. restriction endonucleases, ligases etc. are obtainable from New England 
Biolabs, Inc.). 

Competent cells were prepared and transformed as described by Yasbin, R.E., Wilson, 
GA and Young, F.E. (1975) Transformation and transfection in lysogenic strains of Bacillus 
subtiiis : evidence for selective induction of prophage in competent cells. J. Bacteriol, 121:296- 
304. 

Media 

' ' ' 

LB agar: as described in Ausubel, F. M. et al. (eds.) "Current protocols in Molecular Biology". 
John Wiley and Sons, 1995). 

LBP: LB agar supplemented with 0.05 M potassium phosphate, pH 7.0 

LBPG: is LB agar supplemented with 0.5% Glucose and 0.05 M potassium phosphate, pH 7.0. 
LBPSK: is LB agar supplemented with 0.05 M potassium phosphate, pH 7.0 and 1% of 
I skimmed milk. 

BPX: as described in EP 0 506 780 (WO 91/09129). 

■ 

« 

Fermentations 

Fermentations to evaluate the amylase yields were performed in shakeflasks with 100 ml 
BPX at 37°C, 300 rpm for seven days. Culture volumes of 10 ml were harvested and centrifuged 
at 10.000 g to remove cells and debris. The clear supernants were used for assaying alpha- 
amylase activity. 

Assay for alpha-amvlase activity 

Alpha-amylase activity was determined by a method employing an enzymatic 
colorimetric test with 4,6-ethylidene(G 7 )-p-nitrophenyl(G 1 )-a ) D-maltoheptaoside (ethylidene- 
G 7 PNP) as substrate (Boehringer Mannheim, Germany art. 1442309). Under a specified set of 
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conditions (temp., pH, reaction time, buffer conditions) 1 mg of a given a-amylase will 
hydrolyse a certain amount of substrate and a yellow colour will be produced. The colour 
intensity is measured at 405 nm. The measured absorbance is directly proportional to the 
activity the a-amylase in question under a given set of conditions. 

Example 1. Construction of a yusZ-deletion mutant 

The 1315 bp yuszSOEpcr DNA fragment, which contains an in-frame 255 aa deletion 
of the yusZ gene, was generated by SOE by PCR, and cloned in a plasmid with a temperature 
sensitive origin (pSJ6410). resulting in plasmid pAN28. pAN28 was introduced by 
transformation into the B. subtiiis 168 strain and plated at 45°C (non-permissive temperature) 
on LBPG media supplemented with 1 micro-g/ml erm to select for integrants. Transformants on 
these plates have integrated the plasmid in the yusZ-upstream or yusZ-downstream locus by a 
single (erm*) cross-over event. Excision of the plasmid is by either of two ways, which in one 
case will result in the wildtype strain, and in the other case will result in a strain with the yusZ 
gene substituted by the yuszSOEpcr (AyusZ). 

To allow for excision, selection and identification of strains deleted for yusZ, integrants 
were inoculated in 10 ml of LB and grown overnight at 30°C (permissive temperature). 100 
micro-l of outgrown culture of integrants were transferred to 10 ml of LB and grown at 30°C for 
another overnight. Cells were plated on LBPG at 30°C and strains where double cross-over 
events occurred (integration-excision) were scored as erm" by replica plating. 

PCR with primers yusZIF and yusZ3R was performed on strains to determine 
presence of either a wildtype (2155 bp) or a deleted (1315 bp) yusZ gene in strains where 
double cross-over events had occurred. A yusZ-deleted strain was isolated and named AN133, 
and the deletion was verified by a comprehensive sequence analysis which covered the whole 
yuszSOEpcr region (primers yusZIF and yusZ3R). 

Example 2. Amylase yield from the yz/sZ-deletion mutant 

AN133 was transformed with plasmid pKTH10 which constitutively expresses the 
alpha-amylase AmyQ from Bacillus amytoliquefaciens, The resulting strain was named AN 137. 
The yield of amylase from AN137 was determined in triplicate from two independent isolates 
and compared to the yield of amylase from the control strain AN83. The AN137 strain (AyusZ) 
had an increased alpha-amylase yield, which on average is 205% higher than the control strain 
AIM83, which carries the wild-type yusZ gene. Results are shown in table 1. 

Table 1 . Yields of amylase from the AN137 strain {AyusZ), and the control strain AN83. 



Strain 


Amylase activity 


Average 


Average 




(relative) 


yields 


yields 
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AN1 37-1.1 


21,5 






AN137-1.2 


20,3 




205% 


AN137-1.3 


18,1 


19.3 


AN 137-2.1 


17.4 






AN 137-2.2 


20,6 






AN1 37-2.3 


17,8 






AN83-1.1 


9.11 






AN83-1.2 


7.52 




100% 


AN83-1.3 


6.76 


9.4 


AN83-2.1 


10,5 






AN83-2.2 


10,5 






AN83-2.3 


11.9 







Example 3. Construction of a yusX-deletion mutant 

The 1090 bp yusxSOEpcr DNA fragment, which contains an in-frame 473 aa deletion 
of the yusXgene, was generated by SOE by PCR, and cloned in a plasmid with a temperature 
sensitive origin (pSJ6410), resulting in plasmid pAN23, as described above. pAN23 was 
"introduced by transformation' into the B. subtilis 168 strain and plated at 45°C (non-permissive 
temperature) on LBPG media supplemented with 1 micro-g/ml erm to select for integrants.' 
Transformants on these plates have integrated the plasmid in the yusX-upstream or yusX- 
downstream locus by a single (erm*) cross-over event. Excision of the plasmid is by either of 
two ways, which in one case will result in the wildtype strain, and in the other case will result in 
a strain with the yusX gene substituted by the yusxSOEpcr (AyusX). 

To allow for excision, selection and identification of strains deleted for yusX, integrants 
were inoculated in 10 ml of LB and grown overnight at 30°C (permissive temperature). 100 
microliter of outgrown culture of integrants were transferred to 10 ml of LB and grown at 30°C 
for another overnight. Cells were plated on LBPG at 30°C and strains where double cross-over 
events occurred (integration-excision) were scored as erm" by replica plating. PCR with 
primers yusXIF and yusX3R were performed on strains to determine presence of wt- (2539bp 
PCR-product) or truncated- (1090-bp PCR-product) yusXgene in strains where double cross- 
over events occurred. A yusX-deleted strain was named AN151 and verified by a 
comprehensive sequence analysis which covered the whole yusxSOEpcr region (primers 
yusXIFand yusX3R). 

Example 4. Amylase yield from the yusX-detetlon mutant 
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The yusX-deletion mutant AN151 was transformed with plasmid pKTHIO which 
constitutively expresses the alpha-amylase AmyQ of Bacillus amyloliquefaciens. The resulting 
strain was named AN155. The yields of amylase from AN155 were determined in duplicate 
from two independent isolates and compared to yield of amylase from the control strain AN83. 
Results are shown in table 2; the AN 155 strain which carries a yusX deletion, has an increased 
alpha-amylase yield, which on average is 239% higher than the control strain, AN83, which 
carries the wild-type copy of the yusXgene. 

Table 2. Yields of amylase from the AN137 strain (AyusX), and the control strain AN83. 



Strain 


Amylase activity 


Average 


Average 




(relative) 


yields 


yields 


AN1 55-1.1 


29,9 






AN 155-1 .2 


32,2 


29,8 


238% 


AN155-1.3 


30,2 


AN1 55-2.1 


28,1 






AN 155-2.2 


28,9 






AN1 55-2.3 


29,5 






AN83-1.1 


12,4 






AN83-1.2 


12.9 ' 


12,5 


100% 


AN83-1 .3 


11,5 


AN83-2.1 


12,3 






AN83-2.2 


11.9 






AN83-2.3 


13.8 
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CLAIMS 

1. A mutated prokaryotic cell, which has a reduced expression-level of YusZ (SEQ ID 
NO: 2), YusX (SEQ ID NO: 4), or homologues thereof, and which secretes higher amounts of 
at least one heterologous polypeptide of interest, when compared with an otherwise isogenic 

i 

but non-mutated cell. 

2. The cell according to claim 1, which is a Gram-positive cell. 

3. The cell according to claim 2, which is a Bacillus cell. 

i 
i 

4. The cell according to claim 3, which is a B.alkalophilus, B.amyloliquefaciens, B.brevis, 
B.circulans, B.clausii, B.coagulans, BJautus, B.lentus, B.licheniformis, B.megaterium, 
B.stearothermophilus, B.subtilis, or B.thuringiensis cell. 

i 5. The cell according to any of claims 1 - 4, wherein the YusZ or YusX homologues 

comprise an amino acid sequence at least 70% identical to the sequence shown in SEQ ID 
NO: 2 or SEQ ID NO: 4, respectively. 

6. The cell according to any of claims 1 - 5, which is mutated in yusZ (SEQ ID NO: 1), 
yusX (SEQ ID NO: 3), and/or yusY (SEQ ID NO: 5), or homologues thereof 

7. The cell according to claim 6, wherein the yu$Z t yusX, and/or yusY homologues 
encode a polypeptide having an amino acid sequence at least 70% identical to the sequence 

j shown in SEQ ID NO: 2, SEQ ID NO: 4, or SEQ ID NO: 6, respectively. 

■ 

i 

t 

• 8. The cell according to claim 6, wherein the yusZ, yusX t and/or yusY homologues have 

■ 

| a nucleotide sequence at least 70% identical to the sequence shown in SEQ ID NO: 1, SEQ ID 

NO: 3, or SEQ ID NO: 5, respectively. 

9. The cell according to any of claims 1-8, which is mutated in at least one 
polynucleotide, where a subsequence having a size of at least 100 bp of the at least one 
polynucleotide hybridizes with a polynucleotide having the sequence shown in SEQ ID NO: 1, 
SEQ ID NO: 3, or SEQ ID NO: 5, or the respective complementary sequences, under medium 
stringency hybridization conditions. 

10. The cell according to any of claims 6 - 9, in which yusZ, yusX, and/or yusY, or 
homologues thereof, is/are partially or fully deleted from the chromosome. 

i 
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11. The cell according to any of claims 6 - 9, in which yusZ, yusX, and/or yusY, or 
homologues thereof, comprise at least one frameshift mutation or non-sense mutation. 

12. The cell according to any of claims 1-11, which has at least a two-fold reduced 
expression-level of YusZ or YusX, or homologues thereof, when compared with the otherwise 
isogenic but non-mutated cell. 

13. The cell according to any of claims 1-12, which has no measureable expression of 
YusZ or YusX, or homologues thereof, when compared with the otherwise isogenic but non- 
mutated cell. 

14. The cell according to any of claims 1-13, wherein the at least one heterologous 
polypeptide comprises an enzyme. 

15. The cell according to claim 14, wherein the enzyme is a lyase, a ligase, a hydrolase, 
an oxidoreductase, a transferase, or an isomerase. 

16. The cell according to any of claims 1-15, which comprises one or more 
chromosomally integrated copies of a polynucleotide encoding the at least one heterologous 
polypeptide. . .\ 

17. The cell according to any of claims 1-16, wherein the at least one heterologous 
polypeptide is encoded by a polynucleotide which is transcribed from at least one heterologous 
promoter. 

18. The cell according to claim 17, wherein the at least one promoter comprises an 
artificial promoter. 

19. The cell according to claim 18, wherein the artificial promoter comprises one or more 
mRNA-stabilizing sequence, preferably derived from the cryllla promoter. 

20. A method for constructing a mutated prokaryotic cell, said method comprising the 
steps of: 

a) mutating a prokaryotic cell; and 

b) selecting a mutated cell which has a reduced expression-level of YusZ (SEQ ID NO: 
2) or YusX (SEQ ID NO: 4), or homologues thereof, and which secretes higher 

• amounts of at least one heterologous polypeptide of interest, when compared with an 

otherwise isogenic but non-mutated cell. 
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21 . The method according to claim 20, wherein the cell is a Gram-positive cell. 

22. The method according to claim 21, wherein the cell is a Bacillus cell. 

23. The method according to claim 22, wherein the cell is a B.alkalophilus, 
B.amyloliquefaciens, B.breWs, B.circulans, B.clausii, B.coagulans, B.lautus, B.lentus, 
B.ficheniformis, B.megaterium, B.stearothermophilus, B.subfflis, or B.thuringiensis cell, 

24. The method according to any of claims 20 - 23, wherein the YusZ or YusX 
homologies comprise an amino acid sequence at least 70% identical to the sequence shown 
in SEQ ID NO: 2 or SEQ ID NO: 4, respectively. 

* 

25. The method according to any of claims 20 - 24, wherein the cell in step (a) is mutated 
in yusZ (SEQ ID NO: 1), yusX (SEQ ID NO: 3), and/or yusY (SEQ ID NO: 5), or homologues 
thereof. 

26. The method according to* claim 25, wherein the yusZ, yusX, and/or yusY homologues 
•encode a polypeptide having an amino acid sequence at least 70% identical to the sequence « 
shown in SEQ ID NO: 2, SEQ ID NO: 4, or SEQ ID NO: 6, respectively. 

27. The method according to claim 25, wherein the yusZ, yusX, and/or yusY homologues 
have a nucleotide sequence at least 70% identical to the sequence shown in SEQ ID NO: 1, 
SEQ ID NO: 3, or SEQ ID NO: 5, respectively. 

28. The method according to any of claims 20 - 27, wherein the cell in step (a) is mutated 
in at least one polynucleotide, where a subsequence having a size of at least 100 bp of the at 
least one polynucleotide hybridizes with a polynucleotide having the sequence shown in SEQ 
ID NO: 1, SEQ ID NO: 3, or SEQ ID NO: 5, or the respective complementary sequences, 
under medium stringency hybridization conditions. 

29. The method according to any of claims 25 - 28, wherein the cell in step (a) is mutated 
by partial or full deletion of yusZ, yusX, and/or yusY, or homologues thereof, from the 
chromosome of the cell. 

30. The method according to any of claims 25 - 28, wherein the cell in step (a) is mutated 
by introducing at least one frameshift mutation or non-sense mutation in yusZ, yusX, and/or 
yusY, or homologues thereof. 
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31. The method according to any of claims 20 - 30, wherein the cell selected in step (b) 
has at least a two-fold reduced expression-level of YusZ or YusX, or homologues thereof, 
when compared with the otherwise isogenic but non-mutated cell. 

32. The method according to any of claims 20-31, wherein the cell selected in step (b) 
has no measureable expression of YusZ or YusX, or homologues thereof, when compared 
with the otherwise isogenic but non-mutated cell. 

33. The method according to any of claims 20 - 32, wherein the at least one heterologous 
polypeptide of interest comprises an enzyme. 

34. The method according to claim 33, wherein the enzyme is a lyase, a ligase, a 
hydrolase, an oxidoreductase, a transferase, or an isomerase. 

35. The method according to any of claims 20 - 34, wherein the cell comprises one or 
more chromosomally integrated copies of a polynucleotide encoding the at least one 

heterologous polypeptide of interest. 
».•« r •»«... I- • % . . * ■* * * 

36. The method according to any of claims 20 - 35, wherein the at least one heterologous v 
polypeptide of interest is encoded by a polynucleotide which is transcribed from at least one 
heterologous promoter. 

37. The method according to claim 36, wherein the at least one promoter comprises an 
artificial promoter. 

38. The method according to claim 37, wherein the artificial promoter comprises one or 
more mRNA-stabilizing sequence, preferably derived from the cryllla promoter. 

39. A method for producing a polypeptide of interest, said method comprising the steps of. 

a) cultivating a mutated prokaryotic cell, which has a reduced expression-level of YusZ 
(SEQ ID NO: 2), YusX (SEQ ID NO: 4), or homologues thereof, and which secretes 
higher amounts of the polypeptide of interest, when compared with an otherwise 
isogenic but non-mutated cell; and 

b) isolating the polypeptide of interest. 

40. The method according to claim 39, wherein the cell is a Gram-positive cell. 
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41 . The method according to claim 40, wherein the cell is a Bacillus cell. 

42. The method according to claim 41, wherein the cell is a B.alkalophilus, 
B.amyloliquefaciens, B.brevis, B.circulans, B.clausii, B.coagulans, B.lautus, B.lentus, 
BMcheniformis, B.megaterium, B.stearothermophilus, B.subtilis, or B.thuringiensis cell. 

43. The method according to any of claims 39 - 42, wherein the YusZ or YusX 
homologues comprise an amino acid sequence at least 70% identical to the sequence shown 
in SEQ ID NO: 2 or SEQ ID NO: 4, respectively. 

44. The method according to any of claims 39 - 43, wherein the cell in step (a) is mutated 
in yusZ (SEQ ID NO: 1), yusX (SEQ ID NO: 3), and/or yusY (SEQ ID NO: 5), or homologues 
thereof. 

45. The method according to claim 44, wherein the yusZ, yusX, and/or yusY homologues 
encode a polypeptide having an amino acid sequence at least 70% identical to the sequence 
shown in SEQ ID NO: 2, SEQ ID NO: 4, or SEQ ID NO: 6, respectively. 

46. The method according to claim 44, wherein the yusZ, yusX, and/or yusY homologues 
have a nucleotide sequence at least 70% identical to the sequence shown in SEQ ID NO:..1, 
SEQ ID NO: 3, or SEQ ID NO: 5, respectively. 

47. The method according to any of claims 39 - 46, wherein the cell in step (a) is mutated 
in at least one polynucleotide, wherein a subsequence having a size of at least 100 bp of the at 
least one polynucleotide hybridizes with a polynucleotide having the sequence shown in SEQ 
ID NO: 1, SEQ ID NO: 3, or SEQ ID NO: 5, or the respective complementary sequences, 
under medium stringency hybridization conditions. 

48. The method according to any of claims 39 - 47, wherein the cell in step (a) is mutated 
by partial or full deletion of yusZ, yusX, and/or yusY, or homologues thereof, from the 
chromosome of the cell. 

49. The method according to any of claims 39 - 47, wherein the cell in step (a) is mutated 
by introducing at least one frameshift mutation or non-sense mutation in yusZ, yusX, and/or 
yusY, or homologues thereof. 
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50. The method according to any of claims 39 - 49, wherein the cell in step (a) has at 
least a two-fold reduced expression-level of YusZ or YusX, or homologues thereof, when 
compared with the otherwise isogenic but non-mutated cell. 

51. The method according to any of claims 39 - 50, wherein the cell in step (a) has no 
measureable expression of YusZ or YusX, or homologues thereof, when compared with the 
otherwise isogenic but non-mutated cell. 

52. The method according to any of claims 39-51, wherein the at least one polypeptide 
of interest comprises an enzyme. 

53. The method according to claim 52, wherein the enzyme is a lyase, a ligase, a 
hydrolase, an oxidoreductase, a transferase, or an isomerase. 

54. The method according to any of claims 39 - 53, wherein the cell comprises one or 
more chromosomally integrated copies of a polynucleotide encoding the at least one 
polypeptide of interest. 

55. Thecmethod according to any of claims 39 - 54, wherein the at least one polypeptide 
of interest is 1 encoded by a polynucleotide which is transcribed from at least one heterologous 
promoter. 

56. The method according to claim 55, wherein the at least one promoter comprises an 
artificial promoter. 

57. The method according to claim 56, wherein the artificial promoter comprises one or 
more mRNA-stabilizing sequence, preferably derived from the cry\\\a promoter. 



28 



10576.000-DK 



ABSTRACT 

A mutated prokaryotic cell, which secretes higher amounts of at least one 
heterologous polypeptide of interest and which has a reduced expression-level of YusZ or 
YusX, or homologues thereof, when compared with an otherwise isogenic but non-mutated 
cell, and methods for constructing and using such a cell in the production of polypeptides. 
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<210> 2 
<211> 280 
<212> PRT 

<213> Bacillus subtilis 168 
<400> 2 

Met Asn Lys Lys He Ala lie Val Thr Gly Ala Ser Ser Gly Phe Gly 
15 10 15 

Leu Leu Ala Ala val Lys Leu Ala Arg Ser Phe Phe val lie Ala Thr 

20 25 30 

Ser Arg Gin Pro Glu Lys Ala Glu Gin Leu Arg Glu Leu Ala Ala Ala 
35 40 45 
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SEQUENCE LISTING p\fQ 

<110> Novozymes A/S 

<120> Mutated prokaryotic cells with high secretion-levels 
<130> 10576.000 
<160> 18 

<170> Patentln version 3.2 

<210> 1 
<211> 843 
<212> DNA 

<213> Bacillus subtilis 168 

<400> 1 crt 
atgaataaaa aaatagccat cgtgacagga gcgtccagcg gcttcggtct gctggcagct 60 

gtaaagcttg cccgatcatt tttcgtgata gccacatcaa gacagcctga aaaagcggaa 120 
cagcttcgag aattggctgc agcacacaat gtgtctgatt ctattcacat tacagctctc 180 
gatgtcaccg atgaacaatc tatagtctca ttcggaaaag ctgttagtgc ttacgccccg 240 
atcgatttac tcgttaacaa cgccggaacg gcttatggag gatttatcga ggatgtgccg 300 
atggaacatt tcagacaaca atttgaaacg aatgtcttcg gtgtgatcca tgtgacaaaa 360 
accgtgctgc cttacataag aaagcatggc ggcgcaaaga ttataaacgt gagcagcatc 420 
.agcggactga caggattccc agcgctgtcg ccatatgttt cttccaagca tgcattggaa 480 
:ggtttttctg aaagcctgcg tatcgagctg cttccgttcg gtatcgaaac cgctttgatc 540 
gagccgggct catacaagac atcgatctgg tcaacgtcat tatcaaattt tatgtcggtg 
cctgctgacg attcagccta tcatcaatac tataaaaaga tcctttccta tgttcaaaaa 
aacggagaag aaagcggaga tccccaagag gttgccgacc tcatttatca attggcaaca 720 
aaacagcaca taaagaattt gcgatacccg atcggaaagg gcatcaagct caccctgctg 
ttccgatcgc tttttccttg gtctgcgtgg gagtctatcc tgaagaaaaa gctattcagc 
tga 



600 
660 



780 
840 
843 
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His Asn Val Ser Asp Ser lie His lie Thr Ala Leu Asp val Thr Asp 
50 55 60 

Glu Gin ser lie Val Ser Phe Gly Lys Ala Val Ser Ala Tyr Ala Pro 
65 70 75 80 

lie Asp Leu Leu val Asn Asn Ala Gly Thr Ala Tyr Gly Gly Phe lie 

85 90 95 

Glu Asp Val Pro wet Glu His Phe Arg Gin Gin Phe Glu Thr Asn val 

100 105 HO 

Phe Gly Val lie His Val Thr Lys Thr val Leu Pro Tyr lie Arg Lys 
115 120 125 

His Gly Gly Ala Lys lie lie Asn Val Ser ser lie Ser Gly Leu Thr 
130 135 140 

Gly Phe Pro Ala Leu Ser Pro Tyr val Ser ser Lys His Ala Leu Glu 
145 150 155 160 

Gly Phe ser Glu ser Leu Arg He Glu Leu Leu Pro Phe Gly lie Glu 

165 170 175 

Thr=Ala Leu He Glu Pro Gly. ser Tyr Lys Thr ser ile.Trpser Thr 

180 185 190 

■ 

ser Leu ser Asn Phe Met ser val Pro Ala Asp Asp ser Ala Tyr His 
195 200 205 

Gin Tyr Tyr Lys Lys lie Leu Ser Tyr Val Gin Lys Asn Gly Glu Glu 
210 215 220 

ser Gly Asp Pro Gin Glu val Ala Asp Leu He Tyr Gin Leu Ala Thr 
225 230 235 240 

Lys Gin His lie Lys Asn Leu Arg Tyr Pro He Gly Lys Gly lie Lys 
y 245 250 255 

Leu Thr Leu Leu Phe Arg Ser Leu Phe Pro Trp Ser Ala Trp Glu Ser 

260 265 270 

lie Leu Lys Lys Lys Leu Phe Ser 
275 280 

<210> 3 
<211> 1503 
<212> DNA 

<213> Bacillus subtilis 168 
<400> 3 

atgaatcggc tttccgcaga ttttcaatcg tcgctcgtta cattagatca taaacttgtg 
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gacattaatc 

tacatattaa 

ctgatcggaa 

gtcgggaaaa 

caggcggaga 

ttccgccaag 

ggattccggc 

caaattaaca 

aagaagccgt 

agctggtacg 

gccgcaaata 

gaaaaagcgt 

ggcttttgca 

agcgcctcaa 

atgctcaacg 

acgtttgcag 

aagcttgttc 

gcaagatttc 

agccgcttga 

gaatatcatc 

tacaatttcc 

gaagaaaaag 

acagtggagg 

gagaatgcca 

taa 



aagacgtgtg 
atgaaagaag 
accttgcggt 
tgacgatccc 
atatgatgga 
cttgggagag 
ttgaaaccta 
ggatgaaaaa 
tcgttcagtt 
atgtcgaggc 
tcattaccag 
ttcgggacgg 
ccagttttcc 
atgtctctac 
tcaggccgtt 
agatgatagt 
ttctggagga 
tatttgaaac 
atgagctgat 
cgcttttttg 
cttatacatt 
acacattcga 
atttggcgat 
tcaagctggc 
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gaatgaattg ttaacaaaac cgggattgcg 

acagagggtt gccgaaaagc ttagccccgg 

ggacggatat cacgcttgga gtgacttata 

gtatgaggaa aacggtgaaa acaagcaatt 

tcatcaagac cgtactgtca gaaaaacagt 

caagcaagat attttcagca gcacgctgaa 

taaagcgcgc ggctgggaga atgtcctgaa 

agaaacactt gatacgatgt ggcaggtcat 

tttgaaccgg aaagcatcca tgctcggcct 

accgatcggt tctgacggaa aggtctattc 

ccagttttca acgtttggca aaaagctgtc 

ctggattgag gcggaagaca ggagcggaaa 

ggacagcggg gaatcccgga ttttcatgac 

ccttgcgcat gaactcgggc acgcgttcca 

aaaccgttcc tacgccatga acgttgcaga 

ggcggacgcg actgtccagc aggccgagac 

taaagtgcaa- agaagcgttg cgttcttcat 

gagattctac gaggaacgaa agcggggagt 

ggaagaggcg caaagagagg catactgcaa 

ggcatcaaag cttcattttc acatcacgag 

cggctacctg ttttctcttg gtatttacgc 

agagaagtat atggcgctat tgcgcgatac 

gaagcatttg ggcgctgaca tcacaaagcg 

tgtgcgtgac gccgaaacct ttttacaaat 



cgatgtttct 120 

taaggaaaaa 180 

caatatggtt 240 

gtctgtcggt 300 

atatgaacgt 360 

tcatttggcg 420 

ggaaccgctg 480 

cactgaaaac 540 

tgaaaaactc 600 

gtatgatgaa 660 

ctcattcact 720 

aagagtcggc 780 

atrttcggga 840 

tcaggaagca 900 

aacagcttca 960 

gagggaagaa 1020 

gaacattcac 1080 

ggtaccggcc 1140 

tgcgttagaa 1200 

ggtgccgttt 1260 

gttggcgctt 1320 

ggcttctatg 1380 

cgatttctgg 1440 

gaccgaatct 1500 

1503 



<210> 4 

<211> 500 

<212> PRT 

<213> Bacillus subtil is 168 

<400> 4 

Met Asn Arg Leu ser Ala Asp Phe Gin Ser ser Leu val Thr Leu Asp 
1 5 10 15 

His Lys Leu val Asp He Asn Gin Asp val Trp Asn Glu Leu Leu Thr 

20 25 30 

Lys Pro Gly Leu Arg Asp val Ser Tyr lie Leu Asn Glu Arg Arg Gin 
35 40 45 

Page 3 



10576.000.ST25.txt 

Arg val Ala Glu Lys Leu Ser Pro Gly Lys Glu Lys Leu lie Gly Asn 
50 55 60 

Leu Ala val Asp Gly Tyr His Ala Trp Ser Asp Leu Tyr Asn Met val 
65 70 75 80 

val Gly Lys Met Thr lie Pro Tyr Glu Glu Asn Gly Glu Asn Lys Gin 

85 90 95 

Leu Ser val Gly Gin Ala Glu Asn Met Met Asp His Gin Asp Arg Thr 

100 105 HO 

val Arg Lys Thr val Tyr Glu Arg Phe Arg Gin Ala Trp Glu ser Lys 
115 ' 120 125 

Gin Asp lie Phe Ser Ser Thr Leu Asn His Leu Ala Gly Phe Arg Leu 
130 135 140 

Glu Thr Tyr Lys Ala Arg Gly Trp Glu Asn val Leu Lys Glu Pro Leu 
145 150 155 160 

Gin lie Asn Arg Met Lys Lys Glu Thr Leu Asp Thr Met Trp Gin Val 

165 170 175 

» -* • 

• lie Thr Glu Asn Lys Lys Pro Phe Val Gin Phe Leu Asn Arg Lys Ala 

180 185 190 

Ser Met Leu Gly Leu Glu Lys Leu ser Trp Tyr Asp val Glu Ala Pro 
195 200 205 

lie Gly Ser Asp Gly Lys Val Tyr Ser Tyr Asp Glu Ala Ala Asn lie 
210 215 220 

lie Thr Ser Gin Phe ser Thr Phe Gly Lys Lys Leu Ser Ser Phe Thr 
225 230 235 240 

Glu Lys Ala Phe Arg Asp Gly Trp He Glu Ala Glu Asp Arg Ser Gly 

245 250 255 

Lys Arq val Gly Gly Phe Cys Thr Ser Phe Pro Asp Ser Gly Glu Ser 
7 y 260 265 270 

Arq He Phe Met Thr Phe Ser Gly Ser Ala ser Asn val ser Thr Leu 
275 280 285 

Ala His Glu Leu Gly His Ala Phe His Gin Glu Ala Met Leu Asn Val 
290 295 300 

Arq Pro Leu Asn Arg ser Tyr Ala Met Asn val Ala Glu Thr Ala Ser 
305 310 315 320 
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Thr Phe Ala Glu Met lie val Ala Asp Ala Thr val Gin Gin Ala Glu 

325 330 335 

Thr Arg Glu Glu lvs Leu val Leu Leu Glu Asp Lys val Gin Arg ser 

340 345 350 

Val Ala Phe Phe Met Asn lie His Ala Arg Phe Leu Phe Glu Thr Arg 
355 360 365 

Phe Tyr Glu Glu Arg Lys Arg Gly val val pro Ala ser Arg Leu Asn 
370 375 380 

Glu Leu Met Glu Glu Ala Gin Arg Glu Ala Tyr cys Asn Ala Leu Glu 
385 390 395 400 

Glu Tyr His Pro Leu Phe Trp Ala Ser Lys Leu His Phe His lie Thr 

405 410 415 

Arg val Pro Phe Tyr Asn Phe Pro Tyr Thr Phe Gly Tyr Leu Phe Ser 
* 420 425 430 

Leu Gly lie Tyr Ala Leu Ala Leu Glu Glu Lys Asp Thr Phe Glu Glu 
435 440 445 

Lys Tyr Met Ala Leu Leu Arg Asp Thr A»la ser Met Thr val Glu Asp 
450 455 . 460 

Leu Ala Met Lys His Leu Gly Ala Asp lie Thr Lys Arg Asp Phe Trp 
465 470 475 480 

Glu Asn Ala He Lys Leu Ala val Arg Asp Ala Glu Thr Phe Leu Gin 

485 490 495 

Met Thr Glu Ser 

500 

<210> 5 
<211> 297 
<212> DNA 

<213> Bacillus subtilis 168 

<400> 5 _ ari 

atggcgctgc aaggggtagg ccaaagatgg gatcttgatt cattttttaa gggcgggagc t>u 

caatcggaag aattcaaagg ttatattgag aagctgtcac aaagtctgcg tgcatttcaa 120 

gacaggactg atgcgtttca ggtgcctgaa tcgcctgaag aggctgaagg gctgaccgcg 180 

ttgcttgatt taatcgaaca gacatccgtc aagcttcagc aggccggcgc ttttgttgct 240 

tgcctccagg cccagaatat caatgatcaa aaggctatag agcatcaggc ctcatga 297 

<210> 6 
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<211> 98 
<212> PRT 

<213> Bacillus sutrtilis 168 
<400> 6 

Met Ala Leu Gin Gly val Gly Gin Arg Trp Asp Leu Asp Ser Phe Phe 
15 10 15 

Lys Gly Gly ser Gin ser Glu Glu Phe Lys Gly Tyr lie Glu Lys Leu 

20 25 30 

Ser Gin ser Leu Arg Ala Phe Gin Asp Arg Thr Asp Ala Phe Gin val 
35 40 45 

Pro Glu ser Pro Glu Glu Ala Glu Gly Leu Thr Ala Leu Leu Asp Leu 
50 55 60 

lie Glu Gin Thr Ser val Lys Leu Gin Gin Ala Gly Ala Phe val Ala 
65 70 75 80 

Cys Leu Gin Ala Gin Asn lie Asn Asp Gin Lys Ala lie Glu His Gin 

85 90 95 

Ala ser 



<210> 7 v i 

<211> 25 

<212> ONA 

<213> artificial sequence 
<220> 

<223> Primer yusZlF 

<400> 7 

ccttcccggg gctaagcttt tcggc 



<210> 8 

<211> 30 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Primer yusz2R 

<400> 8 

gatagactcc cacgcgctgg acgctcctgt 

<210> 9 
<211> 30 
<212> DNA 

<213> artificial sequence 

<220> 

<223> Primer yusZ2F 
<400> 9 

acaggagcgt ccagcgcgtg ggagtctatc 
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<210> 10 

<211> 25 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Primer yusZ3R 

<400> 10 

aacggtaccc tgaccaagca gacag 



<210> 11 

<211> 25 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Primer yusXIF 

<400> 11 

aatgcccggg caagctttac agctg 



<210> 12 

<211> 30 

<212> DMA 

<213> artificial sequence 
<220> 

<223> Primer yusx2R 

<400> 12 

ggcgtcacgc acagcaacga gcgacgattg 



<210> 13 

<211> 30 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Primer yusx2F 

<400> 13 

caatcgtcgc tcgtt'gctgt gcgtgacgcc 



<210> 14 

<211> 25 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Primer yusx3R 

<400> 14 

aatcggtacc atcataatga ctgtc 



<210> 15 
<211> 1315 
<212> DNA 

<213> artificial sequence 
<220> 
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<223> PCR product yuszSOEpcr 
<400> 15 

ccttcccggg gctaagcttt tcggcaaccc tctgtcttct ttcatttaat atgtaagaaa 
catcgcgcaa tcccggtttt gttaacaatt cattccacac gtcttgatta atgtccacaa 
gtttatgatc taatgtaacg agcgacgatt gaaaatctgc ggaaagccga ttcatgaggc 180 
ctgatgctct atagcctttt gatcattgat attctgggcc tggaggcaag caacaaaagc 240 
gccggcctgc tgaagcttga cggatgtctg ttcgattaaa tcaagcaacg cggtcagccc 
ttcagcctct tcaggcgatt caggcacctg aaacgcatca gtcctgtctt gaaatgcacg 
• cagactttgt gacagcttct caatataacc tttgaattct tccgattggc tcccgccctt 420 

480 
540 
600 



300 
360 



840 
900 



aaaaaatgaa tcaagatccc atctttggcc taccccttgc agcgccatat cacaaaaccc 
ctttccttct ttcgttctat tatagaacaa ttctgaatta ttgtataaaa atttctatta 
caggcgtcat ttcgtgttca ggatagtaca atagctgtag cagtttaata ggagggttag 

gatgaataaa aaaatagcca tcgtgacagg agcgtccagc gcgtgggagt ctatcctgaa 660 

gaaaaagcta ttcagctgat ctaaattata attattataa tttagtattg atttttattt 720 

agtatatgat ataattaagt caacagatca caaggaggac gttatcttat gaaaactgaa 780 
aacgcaaaaa caaatcaaac attagttgag aattcactga acacacaatt atcaaactgg 
tttcttttat actctaagct ccaccgtttc cattggtatg tgaaagggcc tcatttcttt 

acattgcacg agaaatttga agaactttat gaccatgcgg ctgaaacagt ggataccatc 960 

gctgagcgcc tgctggcgat tggcggacag cctgttgcca cagtgaaaga atacactgag 1020 

catgcatcta tcacagacgg cggaaacgaa acatcagcat cagaaatggt acaagcattg 1080 

gtaaacgact acaaacaaat cagcagcgaa tctaaattcg tgatcggcct ggctgaagaa 1140 

aatcaagaca atgcgacagc ggacttgttt gtcggattaa ttgaagaagt tgaaaaacaa 1200 

gtgtggatgc tttcctctta tttagggtaa caaaaaagct gaaccttaat cgggttcagc 1260 

tttttgtttt ttcttagctt gaactgctrt ctgtctgctt ggtcagggta ccgtt 1315 

<210> 16 

<211> 6305 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Plasmid pAN28 

<400> 16 an 
gctaagcttt tcggcaaccc tctgtcttct ttcatttaat atgtaagaaa catcgcgcaa ou 

tcccggtttt gttaacaatt cattccacac gtcttgatta atgtccacaa gtttatgatc 120 

taatgtaacg agcgacgatt gaaaatctgc ggaaagccga ttcatgaggc ctgatgctct 180 

atagcctttt gatcattgat attctgggcc tggaggcaag caacaaaagc gccggcctgc 240 

tgaagcttga cggatgtctg ttcgattaaa tcaagcaacg cggtcagccc ttcagcctct 300 

tcaggcgatt caggcacctg aaacgcatca gtcctgtctt gaaatgcacg cagactttgt 360 
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mmm ^ft. a«™ mm mm mm aftm asa am, ^km 

gacagcxxcx 


mmm mm. mm. mm. A* ma. mm • m 

caa tataacc 


m*m* mm fm mm mm mm a» f~ 

tt rgaa ltct: 


mmm mm ^mm mmm mmm 4* « mT* 

xccgarrggc 


m* f— f £— ft jm f i~mm m- 

xcccgcccx t 


fi "A fi i *i n **% *ai n 

aaaaaaxgaa 


/On 


tcaagaxccc 


*m mmm mm mfm Mmm. m\m aa mm a** -mm 

axcxxxggcc 


4*m mm -mm mmm mmm mm m+m mt*m JTm mmm 

tacccc TTgc 


mm Mmm. mm mmm mmm mmm **m 4 » * "4 

agcgccaxax 


cacaaaaccc 


cxxxccxxcx 


'tou 


ttcgtxctat 


mmm mm. aftm mm. mm mmm mm mm mm. mm 

tatagaacaa 


"mm*4%m mVm m-m mmx mm mm. m\mm -h— 

txctgaaxta 


xtgtaxaaaa 


mm mMm m% m m^ m ^mm +m mm A, « 

axxxcxaxxa 

• 


mm mm mm Mmm m** jmm ^ «-4 

caggegxeax 




4* a**, a** mtm mHt mmm 4mm- mmm mm 

xxcgxgxtca 


mm. Mm mm. mf** mm mm. mm* mm. mmm mm 

ggatagtaca 


mm. mfm mmm -mm _#> m\m mm 4%m mm —mm 

axagctgtag 


mm mm. mmm m%m mm* mmm mm mm. «W m 

cagttxaata 


Mm mm mm mmm mm mm A* m*m mm Mm 

ggagggxxag 


gaxgaa Laaa 




*~i *™\ am mhm mm. mm mm mm mm 

aaaatagcca 


■** mWm «4» *~™ J-\ -mm mm mmm mm. 

tcgtgacagg 


mmm. mfm -mm 4%m mm* _mm mm mmm mm 

agcgtccagc 


«*> *^ ^ mmm mmm mmm Mm mm. -Arm 

gcgtgggagt 


exaxcexgaa 


gaaaaagcxa 


CCA 

ODU 


m\m m\mr mmm m*\ mmm AW* mtm mm, mm aW 

ttcagcxgat 


ctaaattata 


*■ V "mm «*— mmm m^m *Amm mmm mmm ~~m. —m 


mmm, mmm AXm mm. -mm mmm, mm m*m mtm mm 

txtagxaxtg 


axxxxxaxxx 


— * mm -frnm mm. i^W «a aW a. am aW 

agxaxaxgax 


79n 

/ mU 


r-% »*a m» aft mfm mtm mm, am Mm A* 

ataattaagt 


mm mm mm. mm> mm mmm mm. mmm mmm mm 

caacagatca 


mm mm mm mmm mmm. mm. mmm mmm mm mm 

caaggaggac 


-mm mmm m%m Mm «W Mm -Aw- mfm mm 4\m 

gttatcttax 


gaaaacxgaa 


mm — _a_ a— aa> mmm mmm mm mmm mmm 

aacgeaaaaa 




aft mm. mm mm. #am mm mm m. m& ami 

caaatcaaac 


ft . m a ft _ r mm mmm 

attagttgag 


aattcactga 


acacacaatt 


# a*> a^ «k a. ^aa, A _■ _ ■- _ 

atcaaactgg 


a> a. a. m^m* a. a. a. a> 

xxxcxtxxax 




actctaagct 


ccaccgtttc 


i — mm t ■ * „ J _ m 

cattggtatg 


tgaaagggcc 


*a« a-* — aW aW 4W a* aV. aka aW 

tcatttctxt 


acattgeacg 


yuu 


agaaatttga 


agaactttat 


gaccatgcgg 


cxgaaacagx 


ggataccatc 


gctgagcgcc 




tgctggcgat 


tggcggacag 


cctgttgcca 


cagtgaaaga 


atacactgag 


mm mm Am a*a, a*» «*a Aa a** «■> 

caxgeaxcta 


i mn 


tcacagacgg 


cggaaacgaa 


acatcagcat 


cagaaatggt 


acaagcattg 


as #k fa aftk flft a« mm mm mmm 

gtaaacgacx 


1 Afirt 

XUoU 


acaaacaaat 


cagcagcgaa 


tctraaattcg 


tgatcggcct 


ggctgaagaa 


mm. mmm. mmm ami f* am afea am am aft. 

aatcaagaca 


1 1 /| A 


atgcgacagc 


ggacttgttt 


gtcggattaa 


xxgaagaagt 


xgaaaaacaa 


aft> ^ftm asm ml am aftl am aaa am ai 

gtgtggaxgc 


12 ou 


tttcctctta 


tttagggtaa 


caaaaaagct 


gaaccttaat 


egggttcage 


■ -a- a - - -a- a _ A . 

tttttgtttt 


1Z60 


ttcttagctt 


gaactgcttt 


ctgtctgctt 


ggtcagggta 


ccattttgac 


ggaaccgatt 




gggacgagtc 


ccgaaagctg 


aaccgcatct 


ataagtttca 


aggaaaggct 


mm a. a> a> aft. aa a#m am am Mm 

tgggattggg 




aagtttccaa 


tgaaaacggc 


aactatgatt 


atttgatgta 


tgccgacatc 


gattatgacc 


1440 


atcctgatgt 


cgcagcagaa 


attaagagat 


ggggcacttg 


gtatgccaat 


gaactgcaat 


i cr»n 


tggacggaaa 


ccgtcttgat 


gctgtcaaac 


— — mm * ■ — mmm mmm Mmm 

acattaaatt 


aft* A- r~ aW aV> aV> a. aW ■ 

xtcttttxtg 


cgggattggg 


*l CCA 


«M *W mm pft mm* .mm mm. mtm mm mim 

ttaatcatgt 


mm mm. mm j* _ ■ am. aft. __ 

cagggaaaaa 


acggggaagg 


mm mm a «W mm «W «W «W M«t mm 

aaatgttxac 


«4 aWa, aaa w% a«a aft ai «k am 

ggtagctgaa 


aW mB aW al aftl am am aa am 

tattggcaga 


icon 


m\m mmm mm. mm* m\m mtm mmm m*m Mmm 

atgacttggg 


mm -mm mmm mmm mmm mm am mmm mm mm 

cgcgctggaa 


mmm mm. ^mm mmm Mm mmm 4mm 4W Mm, Mm. 

aactatttga 


acaaaacaaa 


xxxxaatcax 


mtm a**> — * ^a -A— aftaaW afa> aW aft* 

xcagtgxxxg 


icon 
IDOU 


■m mm> aft* mmm mmm a«a -mm am aW 

acgtgccgct 


tcaxtatcag 


ttccatgctg 


catcgacaca 


gggaggegge 


mimm am aW aftj m mkm aa aW am aft 

xatgaxaxga 




mmm mm ma mm mm. mfm mfm* a"* *■» mtm 

ggaaattgct 


mm mm mm mmm mmm mmm mtm mm mm mmm 

gaagggtacg 


a« mm> mm 4* A* Amm mm, .mm mm 

gtcgtttcca 


mm mmm mm mm JLm mm mm, mm mfm «W 1 

agcaxccgxx 


ffta mm mm. mm mAm mm mm. mmm m\m 4- 

gaaaxcggxx 


^a> «~ m-m mtm Am a> am -mv 

acaxxxgxcg 


loUU 


"1 "1 mm, a— i mm mtm» mm* mm 

axaaccaxga 


xacacagccg 


mmm mmm -mmm mmm mmm. mm 

gggcaatcgc 


+m m%m M-m Mmm «W- M-m mmm mm_ mmm. 

xxgagxcgac 


m\m mmm «W ^aa ^4 — . — — mm mm 

xgxecaaaca 


a>a m m -Jka afa> m aa. mmm mmm 

tggxxxaagc 


1 CCA 


CyCiLycxid 


rnrtttfai-t 


M** 4\m mmm mmm _mm. mm mm. mmm 

c ncacaaggg 


aaxcxggaxa 


cccxcaggxx 


xxcxacgggg 


i Q7n 


•a ■♦*■ O n ♦* a ffyfx 

aiatgLdLgy 


/> m "u n i~i 

ycH_yaddyga 


41-% -f* Mm ^ mm mmm m-m 

gacxcccagc 


gCgacLd L LLC 


■f f% t~ ^ ■f m- rt ^ 'm m 

tgccxxgaaa 


cacaaaaxxg 


1QOA 


aaccga lcx l 


aaaagcgaga 


aaacagtarg 


eg td egg age 


acagcaxgax 


taxxxegace 




accatgacat 


tgtcggctgg 


acaagggaag 


gcgacagctc 


ggttgcaaat 


tcaggtxtgg 


2100 


cggcattaat 


aacagacgga 


cccggtgggg 


caaagegaat 


gtatgtcggc 


eggcaaaacg 


2160 


ccggtgagac 


atggcatgac 


attaccggaa 


accgttcgga 


gccggttgtc 


ateaattegg 


2220 


aaggctgggg 


agagtttcac 


gtaaacggcg 


ggtcggtttc 


aatttatgtx 


caaagataga 


2280 


agagcagaga 


ggacggattt 


cctgaaggaa 


atccgttttt 


ttattttgee 


cgtcttataa 


2340 


atttcgttga 


ttacatttta 


taattaattt 


taacaaagtg 
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2400 
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tgctgacagt ttagaatccc taggtaaggc ggggatgaaa tggcaacgtt atctgatgta 

gcaaagaaag aaatgtgtcg aaaatgacgg tatcgcgggt gatcaatcat cctgagactg 

tgacggatga attgaaaaag cttgcatgcc tgcaggtcga ttcacaaaaa ataggcacac 

gaaaaacaag ttaagggatg cagtttatgc atcccttaac ttacttatta aataatttat 

agctattgaa aagagataag aattgttcaa agctaatatt gtttaaatcg tcaattcctg 

catgttttaa ggaattgtta aattgatttt ttgtaaatat tttcttgtat tctttgttaa 

cccatttcat aacgaaataa ttatactttt gtttatcttt gtgtgatatt cttgattttt 

ttctacttaa tctgataagt gagctattca ctttaggttt aggatgaaaa tattctcttg 

gaaccatact taatatagaa atatcaactt ctgccattaa aagtaatgcc aatgagcgtt 

ttgtatttaa taatctttta gcaaacccgt attccacgat taaataaatc tcattagcta 

tactatcaaa aacaattttg cgtattatat ccgtacttat gttataaggt atattaccat 

atattttata ggattggttt ttaggaaatt taaactgcaa tatatccttg tttaaaactt 

ggaaattatc gtgatcaaca agtttatttt ctgtagtttt gcataattta tggtctattt 

caatggcagt tacgaaatta cacctcttta ctaattcaag ggtaaaatgg ccttttcctg 

agccgatttc aaagatatta tcatgttcat ttaatcttat atttgtcatt attttatcta 

tattatgttt tgaagtaata aagttttgac tgtgttttat atttttctcg ttcattataa 

ccctctttaa tttggttata tgaattttgc ttattaacga ttcattataa ccacttattt 

tttgtttggt tgataatgaa ctgtgctgat tacaaaaata ctaaaaatgc ccatattttt 

tcctccttat aaaattagta taattatagc acgagctctg ataaatatga acatgatgag 

tgatcgttaa atttatactg caatcggatg cgattattga ataaaagata tgagagattt 

atctaatttc ttttttcttg taaaaaaaga aagttcttaa aggttttata gttttggtcg 

tagagcacac ggtttaacga cttaattacg aagtaaataa gtctagtgtg ttagacttta 

tgaaatctat atacgtttat atatatttat tatccggagg tgtagcatgt ctcattcaat 

tttgagggtt gccagagtta aaggatcaag taatacaaac gggatacaaa gacataatca 

aagagagaat aaaaactata ataataaaga cataaatcat gaggaaacat ataaaaatta 

tgatttgatt aacgcacaaa atataaagta taaagataaa attgatgaaa cgattgatga 

gaattattca gggaaacgta aaattcggtc agatgcaatt cgacatgtgg acggactggt 

tacaagtgat aaagatttct ttgatgattt aagcggagaa gaaatagaac gattttttaa 

agatagcttg gagtttctag aaaatgaata cggtaaggaa aatatgctgt atgcgactgt 

ccatctggat gaaagagtcc cacatatgca ctttggtttt gtccctttaa cagaggacgg 

gagattgtct gcaaaagaac agttaggcaa caagaaagac tttactcaat tacaagatag 

atttaatgag tatgtgaatg agaaaggtta tgaacttgaa agaggcacgt ccaaagaggt 

tacagaacga gaacataaag cgatggatca gtacaagaaa gatactgtat ttcataaaca 

ggaactgcaa gaagttaagg atgagttaca gaaggcaaat aagcagttac agagtggaat 
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2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 



caccagcgaa aactggttta agccgactgc gcaaaagaca taatcgactc tagaggatcc 
ccggg 



4800 
4860 
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agagcatatg aggtctacga aaccctttga ttatgaaaat gagcgtacag gtttgttctc 4500 

tggacgtgaa gagactggta gaaagatatt aactgctgat gaatttgaac gcctgcaaga 4560 

aacaatctct tctgcagaac ggattgttga tgattacgaa aatattaaga gcacagacta 4620 

ttacacagaa aatcaagaat taaaaaaacg tagagagagt ttgaaagaag tagtgaatac 4680 

atggaaagag gggtatcacg aaaaaagtaa agaggttaat aaattaaagc gagagaatga 4740 
tagtttgaat gagcagttga atgtatcaga gaaatttcaa gctagtacag tgactttata 
tcgtgctgcg agggcgaatt tccctgggtt tgagaaaggg tttaataggc ttaaagagaa 

attctttaat gattccaaat ttgagcgtgt gggacagttt atggatgttg tacaggataa 4920 

tgtccagaag gtcgatagaa agcgtgagaa acagcgtaca gacgatttag agatgtagag 4980 

gtacttttat gccgagaaaa ctttttgcgt gtgacagtcc ttaaaatata cttagagcgt 5040 

aagcgaaagt agtagcgaca gctattaact ttcggtttca aagctctagg atttttaatg 5100 

gacgcagcgc atcacacgca aaaaggaaat tggaataaat gcgaaatttg agatgttaat 5160 

taaagacctt tttgaggtct ttttttctta gatttttggg gttatttagg ggagaaaaca 5220 

taggggggta ctacgacctc ccccctaggt gtccattgtc cattgtccaa acaaataaat 5280 

aaatattggg tttttaatgt taaaaggttg ttttttatgt taaagtgaaa aaaacagatg 5340 

ttgggaggta cagtgatggt tgtagataga aaagaagaga aaaaagttgc tgttacttta 5400 

agacttacaa cagaagaaaa tgagatatta aatagaatca aagaaaaata taatattagc 5460 

aaatcagatg caaccggtat tctaataaaa aaatatgcaa aggaggaata cggtgcattt 5520 

taaacaaaaa aagatagaca gcactggcat gctgcctatc tatgactaaa ttttgttaag 5580 

tgtattagca ccgttattat atcatgagcg aaaatgtaat aaaagaaact gaaaacaaga 5640 

aaaattcaag aggacgtaat tggacatttg ttttatatcc agaatcagca aaagccgagt 5700 

ggttagagta tttaaaagag ttacacattc aatttgtagt gtctccatta catgataggg 5760 

atactgatac agaaggtagg atgaaaaaag agcattatca tattctagtg atgtatgagg 5820 

gtaataaatc ttatgaacag ataaaaataa ttacagaaga attgaatgcg actarcccgc 5880 

agattgcagg aagtgtgaaa ggtcrtgtga gatatatgct tcacatggac gatcctaata 5940 

6000 

6060 



aatxtaaata tcaaaaagaa gatatgatag tttatggcgg tgtagatgtt gatgaattat 
taaagaaaac aacaacagat agatataaat taattaaaga aatgattgag tttattgatg 

# » 

aacaaggaat cgtagaattt aagagtttaa tggattatgc aatgaagttt aaatttgatg 6120 

attggttccc gcttttatgt gataactcgg cgtatgttat tcaagaatat ataaaatcaa 6180 

atcggtataa atctgaccga tagattttga atttaggtgt cacaagacac tcttttttcg 6240 



6300 
6305 



<210> 17 
<211> 1090 
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<212> DNA 

<213> artificial sequence 



<220> 

<223> PCR 

<400> 17 
aatgcccggg 

atggctattt 

ctgaacacga 

aatagaacga 

tgggatcttg 

gagaagctgt 

gaatcgcctg 

gtcaagcttc 

caaaaggcta 

tcgttgctgt 

gtggcgttaa 

tatacctaaa 

ggggctgtat 

caggcagaag 
tcgcttcgtg 
aacgggactc 
gaagaagcgt 
gctgaaacag 
ggtaccgatt 



product yusxSOEpcr 

caagctttac agctgccagc 
ttttattcat cctaaccctc 



aatgacgcct 

aagaaggaaa 

attcattttt 

cacaaagtct 

aagaggctga 

agcaggccgg 

tagagcatca 

gcgtgacgcc 

atgccccggc 

ataaatggaa 

gtctcgcagt 

gagaaaatac 

tgagctacgg 

aggaaaaagc 

tagatgagat 

aagtgataca 



gtaatagaaa 

ggggttttgt 

taagggcggg 

gcgtgcattt 

agggctgacc 

cgcttttgtt 

ggcctcatga 

gaaacctttt 

tttttcaatt 

tcctacaggg 

ggttctgatt 

agccgtcacc 

ggatggtgag 

ggaaatacag 

gaaaatgatt 

cagtgtgctg 



agaccgaagc 

ctattaaact 

tttttataca 

gatatggcgc 

agccaatcgg 

caagacagga 

gcgttgcttg 

gcttgcctcc 

atcggctttc 

tacaaatgac 

cttctctgaa 

gggaaacata 

gcgggctgcc 

cagctgaagt 

cataaccgtt 

gataagctct 

ttgagcgagc 

gcagcattta 



cgctggacgc 

gctacagcta 

ataattcaga 

tgcaaggggt 

aagaattcaa 

ctgatgcgtt 

atttaatcga 

aggcccagaa 

cgcagatttt 

cgaatcttaa 

tgaggatttc 

tgcatttgat 

gtttcaatga 

ccgttcccta 

atgaaggaat 

ccggtgtcaa 

tttccgtgac 

atctggacag 



tcctgtcacg 

ttgtactatc 

attgttctat 

aggccaaaga 

aggttatatt 

tcaggtgcct 

acagacatcc 

tatcaatgat 

caatcgtcgc 

agaaaaagcc 

attctctgga 

cagagcagcc 

agaccagcat 

cagtaatttt 

ctatacaaag 

tcaggaagga 

agaccaaatg 

tcattatgat 



<210> 18 

<211> 6079 

<212> ONA 

<213> artificial sequence 



<220> 

<223> Plasmid pAN23 
<400> 18 

caagctttac agctgccagc 

ttttattcat cctaaccctc 

aatgacgcct gtaatagaaa 

aagaaggaaa ggggttttgt 

attcattttt taagggcggg 

cacaaagtct gcgtgcattt 

aagaggctga agggctgacc 



agaccgaagc 
ctattaaact 
tttttataca 
gatatggcgc 
agccaatcgg 
caagacagga 
gcgttgcttg 



cgctggacgc tcctgtcacg 

gctacagcta ttgtactatc 

ataattcaga attgttctat 

tgcaaggggt aggccaaaga 

aagaattcaa aggttatatt 

ctgatgcgtt tcaggtgcct 

atttaatcga acagacatcc 
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atggctattt 
ctgaacacga 
aatagaacga 
tgggatcttg 
gagaagctgt 
gaatcgcctg 
gtcaagcttc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1090 



60 
120 
180 
240 
300 
360 
420 
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agcaggccgg cgcttttgtt gcttgcctcc aggcccagaa tatcaatgat caaaaggcta 480 

tagagcatca ggcctcatga atcggctttc cgcagatttt caatcgtcgc tcgttgctgt 540 

gcgtgacgcc gaaacctttt tacaaatgac cgaatcttaa agaaaaagcc gtggcgttaa 600 

atgccccggc tttttcaatt cttctctgaa tgaggatttc attctctgga tatacctaaa 660 

ataaatggaa tcctacaggg gggaaacata tgcatttgat cagagcagcc ggggctgtat 720 

gtctcgcagt ggttctgatt gcgggctgcc gtttcaatga agaccagcat caggcagaag 780 

gagaaaatac agccgtcacc cagctgaagt ccgttcccta cagtaatttt tcgcttcgtg 840 

tgagctacgg ggatggtgag cataaccgtt atgaaggaat ctatacaaag aacgggactc 900 

aggaaaaagc ggaaatacag gataagctct ccggtgtcaa tcaggaagga gaagaagcgt 960 

tagatgagat gaaaatgatt ttgagcgagc tttccgtgac agaccaaatg gctgaaacag 1020 

aagtgataca cagtgtgctg gcagcattta atctggacag tcattatgat ggtaccattt 1080 

tgacggaacc gattgggacg agtcccgaaa gctgaaccgc atctataagt ttcaaggaaa 1140 

ggcttgggat tgggaagttt ccaatgaaaa cggcaactat gattatttga tgtatgccga 1200 

catcgattat gaccatcctg atgtcgcagc agaaattaag agatggggca cttggtatgc 1260 

caatgaactg caattggacg gaaaccgtct tgatgctgtc aaacacatta aattttcttt 1320 

tttgcgggat tgggttaatc atgtcaggga aaaaacgggg aaggaaatgt ttacggtagc 1380 

tgaatattgg cagaatgact tgggcgcgct ggaaaactat ttgaacaaaa caaattttaa 1440 

tcattcagtg tttgacgtgc cgcttcatta tcagttccat gctgcatcga cacagggagg 1500 

cggctatgat atgaggaaat tgctgaaggg tacggtcgtt tccaagcatc cgttgaaatc 1560 

ggttacattt gtcgataacc atgatacaca gccggggcaa tcgcttgagt cgactgtcca 1620 

aacatggttt aagccgcttg cttacgcttt tattctcaca agggaatctg gataccctca 1680 

ggttttctac ggggatatgt acgggacgaa aggagactcc cagcgcgaaa ttcctgcctt 1740 

gaaacacaaa attgaaccga tcttaaaagc gagaaaacag tatgcgtacg gagcacagca 1800 

tgattatttc gaccaccatg acattgtcgg ctggacaagg gaaggcgaca gctcggttgc 1860 

aaattcaggt ttggcggcat taataacaga cggacccggt ggggcaaagc gaatgtatgt 1920 

cggccggcaa aacgccggtg agacatggca tgacattacc ggaaaccgtt cggagccggt 1980 

tgtcatcaat tcggaaggct ggggagagtt tcacgtaaac ggcgggtcgg tttcaattta 2040 

tgttcaaaga tagaagagca gagaggacgg atttcctgaa ggaaatccgt ttttttattt 2100 

tgcccgtctt ataaatttcg ttgattacat tttataatta attttaacaa agtgtcataa 2160 

gccctcagga atattgctga cagtttagaa tccctaggta aggcggggat gaaatggcaa 2220 

cgttatctga tgtagcaaag aaagaaatgt gtcgaaaatg acggtatcgc gggtgatcaa 2280 

tcatcctgag actgtgacgg atgaattgaa aaagcttgca tgcctgcagg tcgattcaca 2340 

aaaaataggc acacgaaaaa caagttaagg gatgcagttt atgcatccct taacttactt 2400 

attaaataat ttatagctat tgaaaagaga taagaattgt tcaaagctaa tattgtttaa 2460 
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i atcgtcaatt cctgcatgtt 

* 

j gtattctttg ttaacccatt 

; tattcttgat ttttttctac 

aaaatattct cttggaacca 
tgccaatgag cgttttgtat 
aatctcatta gctatactat 

j aggtatatta ccatatattt 

I 

: cttgtttaaa acttggaaat 

j tttatggtct atttcaatgg 

atggcctttt cctgagccga 
cattatttta tctatattat 
ctcgttcatt ataaccctct 
ataaccactt attttttgtt 
atgcccatat tttttcctcc 
atgaacatga tgagtgatcg 
gatatgagag atttatctaa 
tatagttttg gtcgtagagc 
. tgtgttagac tttatgaaat 
atgtctcatt caattttgag 
caaagacata atcaaagaga 
acatataaaa attatgattt 
gaaacgattg atgagaatta 
gtggacggac tggttacaag 
gaacgatttt ttaaagatag 
ctgtatgcga ctgtccatct 
ttaacagagg acgggagatt 
caattacaag atagatttaa 
acgtccaaag aggttacaga 
gtatttcata aacaggaact 
ttacagagtg gaatagagca 
acaggtttgt tctctggacg 
gaacgcctgc aagaaacaat 
aagagcacag actattacac 
gaagtagtga atacatggaa 
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ttaaggaatt 


gttaaattga 


ttttttgtaa 


tcataacgaa 


ataattatac 


ttttgtttat 


ttaatctgat 


aagtgagcta 


ttcactttag 


tacttaatat 


agaaatatca 


acttctgcca 


ttaataatct 


tttagcaaac 


ccgtattcca 


caaaaacaat 


tttgcgtatt 


atatccgtac 


tataggattg 


gtttttagga 


aatttaaact 


tatcgtgatc 


aacaagttta 


ttttctgtag 


cagttacgaa 


attacacctc 


tttactaatt 


tttcaaagat 


attatcatgt 


tcatttaatc 


gttttgaagt 


aataaagttt 


tgactgtgtt 


ttaatttggt 


tatatgaatt 


ttgcttatta 


tggttgataa 


tgaactgtgc 


tgattacaaa 


ttataaaatt 


agtataatta 


tagcacgagc 


ttaaatttat 


actgcaatcg 


gatgcgatta 


tttctttttt 


cttgtaaaaa 


aagaaagttc 


acacggttta 


acgacttaat 


tacgaagtaa 


ctatatacgt 


ttatatatat 


ttattatccg 


ggttgccaga 


gttaaaggat 


caagtaatac 


gaataaaaac 


tataataata 


aagacataaa 


gattaacgca 


caaaatataa 


agtataaaga 


ttcagggaaa 


cgtaaaattc 


ggtcagatgc 


tgataaagat 


ttctttgatg 


atttaagcgg 


cttggagttt 


ctagaaaatg 


aatacggtaa 


ggatgaaaga 


gtcccacata 


tgcactttgg 


gtctgcaaaa 


gaacagttag 


gcaacaagaa 


tgagtatgtg 


aatgagaaag 


gttatgaact 


acgagaacat 


aaagcgatgg 


atcagtacaa 


gcaagaagtt 


aaggatgagt 


tacagaaggc 


tatgaggtct 


acgaaaccct 


ttgattatga 


tgaagagact 


ggtagaaaga 


tattaactgc 


ctcttctgca 


gaacggattg 


ttgatgatta 


agaaaatcaa 


gaattaaaaa 


aacgtagaga 


agaggggtat 


cacgaaaaaa 


gtaaagaggt 




Page 14 



t 



» 



atatuTtCTl 




ctt xgxgtga 


•> con 


gtttaggaxg 




t xaaaag taa 


t/vU 


cgattaaata 




ttatgttata 


2820 


gcaatatatc 


2880 


ttttgcataa 


2940 


caagggtaaa 


3000 


ttatatttgt 


3060 


ttatattttt 


3120 


acgattcatt 


3180 


aatactaaaa 


3240 


tctgataaat 


3300 


ttgaataaaa 


3360 


ttaaaggctt 


3420 


ataagtctag 


3480 


gaggtgtagc 


3340 


aaacgggata 


3600 


tcatgaggaa 


3660 


taaaattgat 


3720 


aattcgacat 


3/oU 


agaagaaata 


3840 


ggaaaatatg 


3 you 


iA« ^— 

TtttigiicccL 




V« M M <^> ^ ^ ♦» 

agactttdCi 




xgaaagaggc 


4UOU 


gaaagatact 




aaataagcag 


4200 


aaatgagcgt 


4260 


tgatgaattt 


4320 


cgaaaatart 


4380 


gagtttgaaa 


4440 


taataaatta 


4500 



10576.000.ST25.txt 



aagcgagaga 
acagtgactt 
aggcttaaag 
gttgtacagg 
ttagagatgt 
tatacttaga 
taggattttt 
tttgagatgt 
taggggagaa 
ccaaacaaat 
gaaaaaaaca 
ttgctgttac 
aatataatat 
aatacggtgc 
taaattttgt 
aactgaaaac 
. agcaaaagcc 
. attacatgat 
agtgatgtat 
tgcgactatt 
ggacgatcct 



atgatagttt 

tatatcgtgc 

agaaattctt 

ataatgtcca 

agaggtactt 

gcgtaagcga 

aatggacgca 

taattaaaga 

aacatagggg 

aaataaatat 

gatgttggga 

tttaagactt 

tagcaaatca 

attttaaaca 

taagtgtatt 

aagaaaaatt 

gagtggttag 

agggatactg 

gagggtaata 

ccgcagattg 

aataaattta 



gaatgagcag 

tgcgagggcg 

taatgattcc 

gaaggtcgat 

ttatgccgag 

aagtagtagc 

gcgcatcaca 

cctttttgag 

ggtactacga 

tgggttttta 

ggtacagtga 

acaacagaag 

gatgcaaccg 

aaaaaagata 

agcaccgtta 

caagaggacg 

agtatttaaa 

atacagaagg 

aatcttatga 

caggaagtgt 

aatatcaaaa 



tgttgatgaa ttattaaaga aaacaacaac 



gatgaacaag 
gatgattggt 
tcaaatcggt 
acactctttt ttcgcaccag 



tgagtttatt 
gtttaaattt 
atatataaaa 



gaatcgtaga 
tcccgctttt 
ataaatctga 
cgaaaactgg 



ttgaatgtat 

aatttccctg 

aaatttgagc 

agaaagcgtg 

aaaacttttt 

gacagctatt 

cgcaaaaagg 

gtcttttttt 

cctcccccct 

atgttaaaag 

tggttgtaga 

aaaatgagat 

gtattctaat 

gacagcactg 

ttatatcatg 

taattggaca 

agagttacac 

.taggatgaaa 

acagataaaa 

gaaaggtctt 

agaagatatg 

agatagatat 

atttaagagt 

atgtgataac 

ccgatagatt 

tttaagccga 



cagagaaatt 

ggtttgagaa 

gtgtgggaca 

agaaacagcg 

gcgtgtgaca 

aactttcggt 

aaattggaat 

cttagatttt 

aggtgtccat 

gttgtttttt 

tagaaaagaa 

attaaataga 

aaaaaaatat 

gcatgctgcc 

agcgaaaatg 

tttgttrtat 

attcaatttg 

aaagagcatt 

ataattacag 

gtgagatata 

atagtttatg 

aaattaatta 

ttaatggatt 

tcggcgtatg 

ttgaatttag 

ctgcgcaaaa 



tcaagctagt 

agggtttaat 

gtttatggat 

tacagacgat 

gtccttaaaa 

ttcaaagctc 

aaatgcgaaa 

tggggttatt 

tgtccattgt 

atgttaaagt 

gagaaaaaag 

atcaaagaaa 

gcaaaggagg 

tatctatgac 

taataaaaga 

atccagaatc 

tagtgtctcc 

atcatattct 

aagaattgaa 

tgcttcacat 

gcggtgtaga 

aagaaatgat 

atgcaatgaa 

ttattcaaga 

gtgtcacaag 

gacataatcg 



actctagagg atccccggg 



4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520- 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6079 
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